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^ Indian WVYYESKAHV HCSVKSEHSA AAKTGG SEQ ID NO: 23 

Sonic WVYYESKAHI HCSVKAENSV AAKSGG SEQ ID NO. 24 

^ Desert WVYYESRNHV HVS VKADNSL AVRAGO SEQ ID NO. 25 

Gap(s), indicated by added to facilitate alignoiem 
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(57) Abstract: Hedgehog agonists and antagonists can be used to regulate angiogenesis, and have utility in treating tissue repair and 
cancer, and to prevent angiogenesis driven pathologies. 
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with sequence Us ting part of description published sepa- 
rately in electronic form and available upon request from 
the International Burettu 



For two-letter codes and other abbreviations, refer to the ''Guid- 
ance Notes on Codes and Abbreviations'* iq>pearing at the begin- 
ning of each regular issue of the PCT Gazette. 
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ANGIOGENESIS-MODULATING COMPOSITIONS AND USES 
Background Art 

5 Hedgehog proteins act as morphogens in a wide variety of tissues during 

embryonic development Obigham, 1995; Perrimon, 1995; Johnson and Tabin, 1997; 
I]bixunerschmidt et al., 1997). Vertebrate hedgehogs are crucial to a number of 
epithelial-mesenchymal inductive interactions during neuronal development, limb 
development, limg, bone, hair follicle and gut formation (Ericson et al., 1995; Roberts et 

10 ai., 1995; Apelqvist et al., 1997; Ericson et al., 1997; Hammerschmidt et al., 1997; 
Johnson and Tabin, 1995; Pepicelli et al., 1998; Litingtung et al., 1998; Roberts et al., 
1998; Dodd et al., 1998; Dockter, 2000). Mammalian hedgehog genes consist of sonic, 
indian and desert which are highly conserved between species (Zardoj^ 1996). Sonic 
hedgehog (shh) is expressed widely during development and sonic null mice are 

15 embryonic lethal with multiple defects beginning early to midgestation (Bitgood and 
McMahon, 1995; Chiang et al., 1996; Litingtung et al„ 1998; St-Jacques et al., 1998). 
Indian hedgehog (ihh) is expressed less widely and indian null mice survive till late 
gestation. However, Ihh null mice exhibit severe stunting of skeletal growth which 
correlates to the role of Ihh in regulating bone growth plate (St-Jacques et al., 1999; 

20 Karp et al., 2000). Desert hedgehog (dhh) is the most restricted in expression and Dhh 
null mice are viable, but as expected from the expression pattem, male gonads do not 
develop completely and the peripheral nerves develop in a disorganized fashion 
(Bitgood et al., 1996; Parmantier et al., 1999). 

Hedgehog signalling occurs through the interaction of hedgehog protein with the 

25 hedgehog receptor, patched (Ptc) and this interaction's modulation of the co-receptor 
smootfaened (Smo). The mammalian genome contains 2 patched genes, ptcl and ptc2, 
both of which encode 12 transmembrane proteins containing a sterol sensing domain 
(Motoyama et al, 1998; Carpenter et al, 1998). The interaction of Hh and Ptc inactivates 
the repression of smoo&ened (Smo), a 7 transmembrane protein which then leads to 

30 activation of fused (Fu), a serine-threonine kinase, and the disassociation of a 
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transcriptioii factor, Gli, from the microtuble-associated Fu-Gli-Su(fu) complex. The 
uncomplexed Gli protein is transported to the nucleus where it activates downstream 
target genes of the hedgehog pathway including the ptcl and glil genes (Ding et al., 
1999; Murone et al, 1999a; Murone et al, 1999b; Pearse et al., 1999; Stone et al., 1999; 

5 Hynes et al, 2000). 

Hedgehog genes have so far not been implicated directly in embryonic or adult 
angiogenesis. No vascular defects have bem reported in shh, ihh or dhh knockout mice. 
However, we show here that cells in the adult vasculature both express ptcl and can 
respond to exogenous hedgehog and, more importantly, hedgehog is able to induce 

10 robust neovascularization in the corneal pocket model of angiogenesis. The angiogenic 
response to hedgehog appears to occur through the activation of mesenchymal cells to 
produce VEGFs and Angiopoietins. 

Angiogenesis, the process of sproutmg new blood vessels from existing 
vasculature and arteriogenesis, the remodeling of small vessels into larger conduit 

15 vessels are both physiologically important aspects of vascular growth in ad\ilt tissues 
(Klagsbrun and D'Amore, 1991; Folkman and Shing, 1992; Beck and D'Amore, 1997; 
Yancopoulos et al., 1998; Buschman and Schaper, 2000). These processes of vascular 
growth are required for beneficial processes such as tissue repair, wound healing, 
recovery from tissue ischemia and menstrual cycling. They are also required for the 

20 development of pathological conditions such as the growth of neoplasias, diabetic 
retinopathy, rheumatoid arthritis, psoriasis, certain forms of macular degeneration, and 
certain inflammatory pathologies (Cherrington et al., 2000). 

The abiUty to stimulate vascular growth has potential utility for treatment of 
ischemia-induced pathologies such as myocardial infarction, coronaiy artery disease, 

25 peripheral vascular disease, and stroke. The sprouting of new vessels and/or the 
expansion of small vessels in ischemic tissues prevents ischemic tissue death and 
induces tissue repair. Certain growth factors such as those in the vascular endothelial 
growth factor (VEGF) and fibroblast growth factor (FGF) families are able to stimulate 
vascular growth by acting on endotheUal cells to induce angiogenesis. Other fiictors 
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have also been sho^ to have angiogenic and arteriogenic activities such as MCPI 
(Buschman and Schaper, 2000) and angiopoietins. In preclinical models of myocardial 
infarction, both FGFs and VEGFs have been able to improve myocardial 
revascularization and function (Yanagisawa-Miwa et al, 1992; Battler et al., 1993; 
5 Harada et al., 1994; Banai et al., 1994; linger et al., 1994; Mesri et al., 1995; Pearlman 
et al., 1995; Landau et al, 1995; Lazarous et al., 1996; Engler, 1996; Magovem et al., 
1997; Shou et al., 1997). Also in models of peripheral vascular disease, VEGF and other 
angiogenic factors are able to induce angiogenesis and improve vascular perfusion of 
the ischemic limb (Majesky, 2000; Takeshita et al, 1996 and 1994; Rivaid et al., 1998 

10 and 1999, Isner et al, 1996). 

A number of these factors are also implicated in vascular growth in pathological 
conditions such as tumor expansion, diabetic retinopathy and rhematoid arthritis. The 
inhibiton of vascular growth in these contexts has also shown beneficial effects in 
preclinical animal models (Klohs and Hamby, 1999; Zhu and Witte, 1999; Cherrington 

15 et al., 2000). For example, inhibition of angiogenesis by blocking vascular endothelial 
growth factor or its receptor has resulted in inhibition of tumor growth and in 
retinopathy (Fong et al., 1999; Wood et al., 2000; Ozaki et al., 2000). Also, the 
development of pathological pannus tissue in rheumatoid arthritis involves angiogenesis 
and can be blocked by inhibitors of angiogenesis (Peacock et al., 1995; Storgard et al., 

20 1999). 

Thus, the induction of angiogenesis and vascular growth is beneficial for tissue 
repair and would healing wiiereas inhibition of angiogenic growth factors can prevent 
angiogenesis driven pathologies. It would be useful to develop novel ^erapeutics that 
modulate angiogenesis. 

25 

Summary Of The Invention 

Hedgehog proteins are angiogenic growth factors which can have utility in 
treatir^ tissue repair and ischemia and that inhibition of the hedgehog proteins and the 
hedgehog pathway can prevent angiogenesis drivra pathologies. 
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Brief Description of Drawings 

Figure 1 : Alignment of N-tenninal fragments of Human Hedgehog Proteins 
5 Figure 2: Consensus sequence of a hedgehog protein suitable for use in 

developing the conjugated proteins of the invention, antagonist, where "Xaa" indicates • 
amino acids that differ between the Sonic, Indian and Desert hedgehog protems. 

Detailed Description Of The Invention 

10 The present invention relates to the use of hedgehog protein, DNA, or other 

hedgehog therapeutic as an agent to induce the growth of new blood vessels, ie 
angiogenesis, arteriogenesis or vascular growth in adult tissues where the induction of 
angiogenesis has thempeutic value. The present invention also relates to the use of 
inhibitors of hedgehog protein or signaling to prevent angiogenesis contributing to 

15 pathological conditions such as neoplasia (tumors and gliomas), diabetic retinopathy, 
rheumatoid ardiritis, osteroarthritiLs, macular degeneration, psoriasis, ulcemtive colitis, 
Chrohn's disease, and inflammation. 

All references cited in the Detailed Description are incorporated herein by 
references, unless stipulated otherwise. The following terms are used herein: 

20 L Definitions 

"Angiogenesis" is defined as any alteration of an existing vascular bed or the 
formation of new vasculature which benefits tissue perfusion* Ibis includes the 
formation of new vessels by sprouting of endothelial cells from existing blood vessels or 
the remodeling of existing vessels to alter size, nmturity. direction or flow properties to 
25 improve blood perfusion of tissue. 

Mesenchynoial cells are defined as cells of mesenchymal origin including 
fibroblasts, stromal cells, smooth muscle cells, skeletal muscle cells, cells of osteogenic 
origin such as chondrocytes, cells of hemaeopoietic origin such as monocytes, 
macrophages, lymphocytes, granulocytes and cells of adipose origin such as adipocytes. 
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A hedgehog therapeutic, whether it is a. hedgehog angonist or hedgehog 
antagonist is said to have '^erapeutic efficacy'' in modulating angiogenesis and an 
amount of the therapeutic is said to be a ''angiogenic modulatory amount", if 
administration of that amount of the therapeutic is sufficient to cause a significant 

s modulation (i.e., increase or decrease) in angiogenic activity when admiiustered to a 
subject (e.g., an animal model or human patient) needing modulation of angiogenesis. 

As used herein, a hedgehog therapeutic of the invention is an ''agonist'' if it 
"modulates" hedgehog biological activity (i.e., elicits, allows and/or enhances hedgehog 
biological activity). For the purposes of the invention an agonist also refers to an agent, 

10 e.g., a polypeptide such as an hedgehog or patched or a small organic molecule lAiiich 
can elicit, allow and/or enhance hedgehog and/or patched-mediated binding or \siiich 
can otherwise modulate hedgehog and/or patched function, e.g., by activating hedgehog- 
ligand mediated hedgehog signal transduction. Such an agonist of the hedgehog^patched 
interaction is an agent which has one or more of the following properties: (1) it coats, or 

15 binds to, a hedgehog protein associated with an extracellular matrix, e.g., heparin, 
heparin proteoglycans, collagen, fibronectin, vitronectin, thrombospondin, or on the 
sur&ce of a hedgehog bearing or secreting cell with sufficient specificity to modulate a 
hedgehog-ligand/hedgehog receptor interaction, e.g., the hedgehog/patched-smoothened 
interaction; (2) it coats, or binds to, a hedgehog on the surface of a hedgehog- bearing or 

20 secreting cell with sufficient specificity to modify, and preferably to modulate, 
transduction of a hedgehog-mediated signal e.g., hedgehog/patched-smoothened - 
mediated signaling; (3) it coats, or binds to, a hedgehog receptor or co-receptor, (e.g., 
patched, smoothened or a heparin proteoglycan) in or on cells with sufficient specificity 
to modulate the hedgehog/patched-smoothened interaction; (4) it coats, or binds to, a 

25 hedgehog receptor (e.g., patched or smoothened) in or on cells with sufficient specificity 
to modify, and prefembly to modulate, transduction of hedgehog receptor mediated 
hedgehog signaling, e.g., patched, smoothened, fused or gli-mediated hedgehog 
signaling. 
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In preferred embodiments an agonist has one or both of properties 1 and 2. In 
other preferred embodiments the agonist has one or both of properties 3 and 4. 
Moreover, more than one agonist can be administered to a patient, e.g*, an agent AA^ch 
binds to hedgehog can be combined with an agent which bmds to patched. Moreover, a 
5 hedgehog therapeutic is an "agonist" if it modulates angiogenesis in such a way as to 
enhance, elicit, accelerate or increase angiogenesis, regardless of the mode of action of 
such ther£^eutic. 

As used herein, a hedgehog therapeutic is an "antagonist" if it de-activates the 
hedgehog receptor or inhibits its activity or inhibits activity of the hedgehog protein. 

10 Such an antagonist may additionally have one or more of the following properties: (1) it 
coats, or binds to, a hedgehog protein on the sxirface of a hedgehog bearing or secreting 
cell with sufScient specificity to de-activate or inhibit a hedgehog-ligand/hedgehog 
interaction, e.g., the hedgehog/patched interaction; (2) it coats, or binds to, a hedgehog 
protein on the surface of a hedgehog- bearing or secreting cell with sufficient specificity 

IS to modify, and preferably to de-activate or inhibit, transduction of a hedgehog-mediated 
signal e.g., hedgehog/patched, smootiiened, fiised, or gli -mediated signaling; (3) it 
coats, or binds to, a hedgehog receptor or coreceptor (e.g., patched or smoothened) in or 
on cells with sufficient specificity to de-activate or inhibit the hedgehog /patched 
interaction; (4) it coats, or binds to, a hedgehog receptor or co-receptor (e.g., patched or 

20 smoothened) in or on cells with sufficient specificity to modify, and preferably to de- 
activate or inhibit transduction of hedgehog receptor mediated hedgehog signaling, e.g., 
patched-mediated hedgehog signaling. In preferred embodiments an antagonist has one 
or both of properties 1 and 2. In other pref^ed embodiments the antagonist has one or 
both of properties 3 and 4. Moreover, more than one antagonist can be administered to a 

25 patient, e.g., an agent which binds to hedgehog can be combined with an agent which 
binds to patched. Moreover, a hedgehog therapeutic is an "antagonist* if it modulates 
angiogenesis in such a way as to inhibit, decelerate, reverse or otherwise slow 
angiogenesis, regardless of the mode of action of such therapeutic. For example, 
antagonist molecules may be antibody homologs (defined below), certain fragments of 
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hedgehog, or small organic molecules that may be admhdstered and modulate hedgehog 
binding sites on cells. 

As discussed herein, the hedgehog therapeutics (i.e., antagonists or agonists) that 
can be linked or otherwise conjugated to, for instance, an antibody homolog such as an 
5 immunoglobulin or fragment thereof are not limited to a particular type or structure of 
hedgehog or patched or other molecule so that, for purposes of the invention, any agent 
capable of forming a chimeric protein and capable of effectively modulating hedgehog 
is considered to be an equivalent of the therapeutics used in the examples herein. 

As used herein, the term '"antibody homolog" includes intact antibodies 
10 consisting of immunoglobulin light and heavy chains linked via disulfide bonds. The 
term "antibody homolog" is also intended to encompass a hedgehog therapeutic 
comprising one or more polypeptides selected from immunoglobulin light chains, 
immunoglobulin heavy chains and antigen-binding fragments thereof which are capable 
of binding to one or more antigens (i.e., hedgehog or patched). The component 
15 polypeptides of an antibody homolog composed of more than one polypeptide may 
optionally be disulfide-bound or otherwise covalmtly crosslinked. Accordingly, 
therefore, "antibody homologs" include intact immunoglobulins of types IgA, IgG, IgE, 
IgD, IgM (as well as subtypes thereof), wherein the light chains of the immunoglobulin 
may be of types kappa or lambda or portions of intact antibodies that retain antigen- 
ic binding specificity, for example. Fab fragments. Fab' fi-agments, F(ab')2 firagments, F(v) 
fragments, heavy chain monomers or dimers, light chain monomers or dimers, dimers 
consisting of one heavy and one light chain, and the like. 

As used herein, a '"humanized antibody homolog" is an antibody homolog, 
produced by recombinant DNA technology, in which some or all of the amino acids of a 
25 human immunoglobulin light or heavy chain that are not required for antigen binding 
have been substituted for the corresponding amino acids fix)m a nonhimian mammal lan 
immunoglobulin light or heavy chain. A '"himian antibody homolog" is an antibody 
homolog in which all the amino acids of an immunoglobulin light or heavy chain 
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(regardless of whether or not they are required for antigen hinding) are derived from a 
human source. 

"amino acid"- a monomeric unit of a peptide, polypeptide, or protein. There are 
twenty amino acids found in naturally occurring peptides, polypeptides and proteins, all 

5 of which are L-isomers. The term also includes analogs of the amino acids and D- 
isomers of the protein amino acids and their analogs. 

A hedgehog therapeutic has "biological activity" if it has at least one of the 
following properties: (i) it has tiie ability to bind to its receptor, patched or it encodes, 
upon expression, a polypeptide that has this characteristic; and/or (ii) it may induce 

10 alkaline phosphatase activity in C3H10T1/2 cells. The hedgehog therapeutic protein 
meeting this functional test of "biological activity" may meet the hedgehog consensus 
criteria as dejSned herein in Figure 2 (SEQ ID NO: 26). This term "biological activity" 
includes antagonists and agonists. 

The term ^'bioavailability" refers to the ability of a compound to be absorbed by 

15 the body after administration. For instance, a j&rst compound has greater bioavailability 
flian a second compound if, when both are administered in equal amounts, the first 
compound is absorbed into the blood to a greater extent tiian the second compound. 

The term "chimeric" hedgehog therapeutic is a generic term referring to 
constructs X-A, where "X" is a polypeptide having the amino acid sequence or portion 

20 thereof, consisting of Ihe amino acid sequence of a hedgehog protein and "A" is at least 
part of a polypeptide other than hedgehog. "A" may include a linker sequence (as 
defined below) and may be attached to either, or both, of the N- or C-terminii of the 
hedgehog moiety. Chimeric hedgehog tiierapeutics of the invention therefore include 
compounds in which the various moieties are chemically cross-linked or covalently 

25 "fused" (as defined below). 

As used herein, the term "covalently coupled" means that the specified moieties 
of the hedgehog ttierapeutic are either directly covalently bonded to one another, or else 
are indirectly covalently joined to one another through an intervening moiety or 
moieties, such as a bridge, spacer, or linkage moiety or moieties. The int^ening moiety 
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or moieties are called a "coupling group". The term "conjugated" is used 
interchangeably with "covalently coupled". 

"expression control sequence"- a sequence of polynucleotides that controls and 
regulates egression of genes when operatively linked to those genes. 
5 "expression vector^'- a polynucleotide, such as a DNA plasmid or phage (among 

other common examples) which allows expression of at least one gene when the 
expression vector is introduced into a host cell. The vector may, or may not, be able to 
replicate in a cell. 

The phrase "extracellular signaling protein" means any protein that is either 
10 secreted from a cell, or is associated with the cell membrane, and upon binding to the 
receptor for that protein on a target cell, triggers a response in the target cell. 

"functional equivalent*' of an amino acid residue is (i) an amino acid having 
similar reactive prop^es as the amino acid residue that was replaced by the functional 
equivalent; (ii) an amino acid of a ligand of a polypeptide of the mvCTtion, the amino 
15 acid having similar properties as the amino acid residue that was replaced by the 
functional equivalent; (iii) a non-amino acid molecule having similar properties as the 
amino acid residue tiiat was replaced by the functional equivalent. 

A first polynucleotide encoding hedgehog protein is "functionally equivalent" 
compared with a second polynucleotide encoding hedgehog protein if it satisfies at least 
20 one of the following conditions: 

(a) the "functional equivalent" is a first polynucleotide that hybridizes to the 
second polynucleotide under standard hybridization conditions and/or is degenerate to 
the first polynucleotide sequence. Most preferably, it encodes a mutant hedgehog having 
the activity of an hedgehog therapeutic; 
25 (b) the Afunctional equivalent" is a first polynucleotide that codes on expression 

for an amino acid sequence encoded by the second polynucleotide. 

The term **hedgehog therapeutic" includes, but is not limited to, the agonist 
and/or antagonist agents listed herein as well as their functional equivalents. As used 
herein, the term "functional equivalent" therefore refers to, for example, an hedgehog 
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protein or a polynucleotide encoding the hedgehog protein that has the same or an 
improved beneficial effect on the mammalian recipient as the hedgehog of which it is 
deemed a functional equivalent. As will be appreciated by one of ordinary skill in the 
art, a fimctionally equivalent protein can be produced by recombinant techniques, e.g., 
by expressing a 'functionally equivalent DNA". Accordingly, the instant invention 
embraces hedgehog therapeutics encoded by naturally-occurring DNAs, as well as by 
non-naturally-occumng DNAs vdiich encode the same protein as encoded by the 
naturally-occurring DNA. Due to the degeneracy of the nucleotide coding sequences, 
other polynucleotides may be used to encode hedgehog protein. These include all, or 
portions of the above sequences which are altered by the substitution of different codons 
that encode the same amino acid residue within the sequence, thus producing a silent 
change. Such altered sequences are regarded as equivalents of these sequences. For 
example, Phe (F) is coded for by two codons, TTC or TTT, Tyr (Y) is coded for by TAG 
or TAT and His (H) is coded for by CAC or CAT. On the other hand, Trp (W) is coded 
for by a single codon, TGG. Accordingly, it will be appreciated that for a given DNA 
sequence encoding a particular hedgehog there will be many DNA degenerate sequences 
that will code for it. These degenerate DNA sequences are considered within the scope 
of this invention. 

The term ^fusion" or 'fusion protein" is a species of chimeric hedgehog 
therapeutic and refers to a co-linear, covalent link^e of two or more proteins or 
firagments thereof via their individual peptide backbones, most preferably through 
genetic egression of a polynucleotide molecule encoding those proteins. It is preferred 
that the proteins or fragments thereof are jfrom different soiirces (e.g., a 'chimeric' 
protein). Thus, preferred fusion therapeutics include an hedgehog protein or jfiragment 
covalentiy linked to a second moiety that is not a hedgehog protein. In certain 
embodiments, the non-hedgehog moiety may be a protein having a domain or region 
which is homologous to a member of the immunoglobulin gene superfamily. Members 
of this superfamily inlcude class I and class n major histocompatability antigens, CD4 
and T cell receptor chains. Further examples of members of this family and fusion 
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proteins containing them are found in US 5,565,335 (Genentech), incorporated herein 
by reference. 

Non-hedgehog proteins of this type are useful if they contain one or more amino 
acid sequences at least 20, 50, 75 or 150 residues in length, that are at least 40% 
homologous to a sequence of an immunoglobulin constant or variable region. A non- 
hedgehog protein meeting these requirements is said to possess an "Ig-like domain" 
which may be an 'Ig-like constant domain" or an 'Ig-like variable domain". Thus, one 
embodiment of the present invention is a chimeric hedgehog therapeutic in which the 
non-hedgehog moiety contains at least one Ig-like domain, or portion thereof. 

Other embodiments are possible. Specifically, a **hedgehog/Ig fusion" is a 
hedgehog therapeutic comprising a biologically active hedgehog molecule of the 
invention (i.e.. Sonic hedgehog), or a biologically active fragment thereof (i.e., the N- 
terminal portion) linked to an N-terminus of an immunoglobulin chain wherein a portion 
of the N-terminus of the immunoglobulin is replaced with the hedgehog. A species of 
hedgehog/Ig fusion is an 'hedgehog /Fc fusion" which is a protein comprising an 
hedgehog molecule of the invention (i.e., hedgehog -) linked to at least a part of the 
constant domain of an immunoglobulin. Also, the term "fusion protein" means an 
hedgehog protein chemically linked via a mono- or hetero- functional molecule to a 
second moiety that is not an hedgehog protein and is made de novo from purified 
protein as described below. Thus, this invention features a hedgehog therapeutic 
moleciile which includes: (1) a hedgehog moiety, (2) a second peptide, e.g., one which 
increases solubiUty or in vivo life time of the hedgehog moiety, e.g., a member of the 
immunoglobulin super family or firagment or portion thereof, e.g., a portion or a 
firagment of IgG, e.g., the human IgGl heavy chain constant region, e.g., CH2, CHS, and 
hinge regions; and a toxin moiety. 

"Heterologous promoter"- as used herein is a promoter which is not naturally 
associated with a gene or a purified nucleic acid. 

"Homology" and "identity" each refer to sequence similarity between two 
polypeptide sequmces, and both 'homology and 'identity* are used interchangeably in 
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this disclosure. Homology can be determined by comparing a position in each sequence 
which may be aligned for purposes of comparison. When a position in the compared 
sequence is occupied by the same amino acid residue, then the polypq)tides can be 
referred to as identical at that position; when the equivalent site is occupied by the same 
amino acid (e.g., identical) or a similar amino acid (e.g., similar in steric and/or 
electronic nature), then the molecules can be referred to as homologous at that position. 
A percentage of homology between sequences is a function of the number of matching 
or homologous positions shared by the sequences. An "unrelated" or "non-homologous" 
sequence shares less than 40 percent identity, though prefembly less than 25 percent 
identity, with a sequence of the present mvention. 

For instance, if 6 of 10 of the positions in two sequences are matched or are 
homologous, then the two sequences are 60% homologous. By way of example, the 
DNA sequences CTGACT and CAGGTT share 50% homology (3 of the 6 total 
positions are matched). Generally, a comparison is made when two sequences are 
aligned to give maximum homology. Such aligmnent can be provided using, for 
instance, the method of Needleman et al., J. Mol Biol. 48: 443-453 (1970), implemented 
conveniently by computer programs described in more detail below. Homologous 
sequences share identical or similar amino acid residues, where similar residues are 
conservative substitutions for, or "allowed point mutations" of, corresponding amino 
acid residues in an aligned reference sequence. In this regard, a "conservative 
substitution" of a residue in a reference sequence are those substitutions that are 
physically or functionally similar to the corresponding reference residues, e.g., that have 
a similar size, shape, electric charge, chemical properties, including the ability to form 
covalent or hydrogen bonds, or the like. Particularly preferred conservative substitutions 
are those fulfilling the criteria defined for an "accepted point mutation" in Da;^off et 
al., 5: Atlas of Protein Sequence and Structure, 5: Suppl. 3, chapter 22: 354-352, Nat 
Biomed. Res. Foundation, Washington, D.C. (1978). 

"Percent homology/identity" of two amino acids sequences or two nucleic acid 
sequences is determined using the alignment algorithm of Karlin and Altschul (Proc. 
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Nat Acad. Sci., USA 87: 2264 (1990) as modified in Karlin and Altschul (Proc. Nat 
Acad. Sci., USA 90: 5873 (1993). Such an algorithm is incorporated into the NBLAST 
or XBLAST programs of Altschul et al., J. Mol. Biol. 215: 403 (1990). BLAST searches 
are performed with the NBLAST progi:am, score = 100, wordlength = 12, to obtain 

5 nucleotide sequences homologous to a nucleic acid of tiie invention. BLAST protein 
searches are performed with the XBLAST program, score = 50, wordlength = 3, to 
obtain amino acid sequences homologous to a reference polypeptide. To obtain gapped 
aligmnents for comparisons, gapped BLAST is used as described in Altschul et al.. 
Nucleic Acids Res., 25: 3389 (1997). When using BLAST and Gapped BLAST, the 

10 default parameters of tiie respective progranos (XBLAST and NBLAST) are used. See 
ht^://www^ncbi.nlm.nih.gov. 

The term "hedgehog N-tenninal fragment" may be used interchangeably with 
"Hedgehog" and refers to the active mature sequence that is proteolytically cleaved from 
the hedgehog precursor. 

15 The temi "hydrophobic" refers to the tendency of chemical moieties with 

nonpolar atoms to interact with each other rather than water or other polar atoms. 
Materials that are "hydrophobic" are, for flie most part, insoluble in water. Natural 
products with hydrophobic properties include lipids, fatty acids, phospholipids, 
sphingolipids, acylglycerols, waxes, sterols, steroids, terpenes, prostaglandins, 

20 fliromboxanes, leukotrienes, isoprenoids, retenoids, biotin, and hydrophobic amino acids 
such as tryptophan, phenylalanine, isoleucine, leucine, valine, methionine, alanine, 
proline, and tyrosine. A chemical moiety is also hydrophobic or has hydrophobic 
properties if its physical properties are determined by the presence of nonpolar atoms. 

The phrase "internal amino acid" means any amino acid in a peptide sequence 

25 that is neither the N-terminal amino acid nor the C-terminal amino acid. 

"Isolated" (used interchangeably with "substantially pure") when applied to 
nucleic acid i.e., polynucleotide sequences that encode polypeptides, means an RNA or 
DNA polynucleotide, portion of genomic polynucleotide, cDNA or synthetic 
polynucleotide vsdiich, by virtue of its origin or manipulation: (i) is not associated with 
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all of a polynucleotide with which it is associated in nature (e.g., is present in a host cell 
as an expression vector, or a portion thereof); or (ii) is linked to a nucleic acid or other 
chemical moiety other than that to which it is linked in nature; or (iii) does not occur in 
nature. By "isolated" it is further meant a polynucleotide sequence that is: (i) amplified 
5 in vitro by, for example, polymerase chain reaction (PGR); (ii) synthesized chemically; 
(iii) produced recombinantly by cloning; or (iv) purified, as by cleavage and gel 
separation. 

"Isolated" (used interchangeably with "substantially pure") when applied to 
polypeptides means a polypeptide or a portion thereof which, by virtue of its origin or 

10 manipulation: (i) is present in a host cell as the expression product of a portion of an 
expression vector; or (ii) is linked to a protein or other chemical moiety other than that 
to which it is linked in nature; or (iii) does not occur in nature, for example, a protein 
that is chemically manipulated by appending, or adding at least one hydrophobic moiety 
to the protein so that the protein is in a form not found in nature. By "isolated" it is 

15 further meant a protein that is: (i) synthesized chemically; or (ii) expressed in a host cell 
and purified away &om associated and contaminating proteins. The term generally 
means a polypeptide that has been separated frdm other proteins and nucleic acids with 
which it naturally occurs. Preferably, the polypeptide is also separated from substances 
such as antibodies or gel matrices (polyacrylamide) v^ch are used to purify it. 

20 "multivalent protein complex" refers to a plurality of hedgehog ther^eutics (i.e., 

one or more). 

"mutant" is any change in the genetic material of an organism, in particular any 
change (i.e., deletion, substitution, addition, or alteration) in a wild type polynucleotide 
sequence or any change in a wild type protein. The term "mutein" is used 
25 intCTchangeably with "mutant". 

"N-terminal end" refers to tiie first amino acid residue (amino acid number 1) of 
the mature form of a protein. 

"N-tenninal cysteine" refers to the amino acid number 1 as shown in SEQ ID 
NOS. 23-26. In certain embodiments of the hedgehog therapeutic, the N-terminal 
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cysteine has been "modified". The term "modified" in this regard refers to chemical 
modifications of the N-terminal cysteine such as linkage thereof to another moiety such 
as a hydrophobic group and/or replacement of the N-terminal cysteine with another 
moiety, such as a hydrophobic group. 

5 "opemtively linked": A polynucleotide sequence (DNA, KNA) is operatively 

linked to an expression control sequence when the expression control sequence controls 
and regulates the transcription and translation of that polynucleotide sequence. The term 
"operatively linked" includes having an appropriate start signal (e.g., ATG) in firont of 
the polynucleotide sequence to be expressed, and maintaining the correct reading frame 

10 to permit e?q)ression of the polynucleotide sequence under the control of the e>q>ression 
control sequence, and production of the desired polypeptide encoded by the 
polynucleotide sequence. 

"protein" is any polymer consisting essentially of any of the 20 amino acids. 
Although "polypeptide" is often used in reference to relatively large polypeptides, and 

15 "peptide" is often used in reference to small polypeptides, usage of these terms in the art 
overlaps and is varied. The term "protein" as used herein refers to peptides, proteins and 
polypeptides, unless otherwise noted. 

The terms "peptide(s)", "protein(s)" and "polypeptide(s)" ' are used 
interchangeably herein. The terms "polynucleotide sequence" and "nucleotide sequence" 

20 are also used interchangeably herein. 

"Recombinant," as used herein, means that a protein is derived from 
recombinant, mammalian expression systems. Since hedgehog is not glycosylated nor 
contains disulfide bonds, it can be expressed in most prokaryotic and eukaryotic 
expression systems. 

25 "Spacer" sequence refers to a moiety that may be inserted between an amino acid 

to be modified with an antibody homolog or fragment and the remainder of tiie protein. 
A spacer is designed to provide separation between the modification and the rest of the 
protein so as to prevent the modification from interfering with protein function and/or 
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make it easier for the modification to link with an antibody homolog moiety or any other 
moiety. 

Thus, ''substantially pure nucleic acid" is a nucleic acid which is not immediately 
contiguous with one or both of the coding sequences with vMch it is normally 

5 contiguous in the naturally occurring genome of the organism from which the nucleic 
acid is derived. Substantially pure DNA also includes a recombinant DNA which is part 
of a hybrid gene encoding additional hedgehog sequences. 

The phrase "surface amino acid" means any amino acid that is e>q)osed to 
solvent when a protein is folded in its native form. 

10 "standard hybridization conditions" refer to salt and temperature conditions 

substantially equivalent to 0.5 X SSC to about 5 X SSC and 65°C for both hybridization 
and wash. The term "standard hybridization conditions" as used herein is therefore an 
operational definition and encompasses a range of hybridization conditions. 
Nevertheless, for the purposes of this present disclosure "high stringency" conditions 

15 include hybridizing with plaque screen buffer (0.2% polyvinylpyrrolidone, 0.2% FicoU 
400; 0.2% bovine serum albumin, 50 mM Tris-HCl (pH 7.5); 1 M NaCl; 0.1% sodium 
pyrophosphate; 1% SDS); 10% d^ctran sulfate, and 100 ug/ml denatured, sonicated 
salmon sperm DNA at 65 ° C for 12-20 hours, and washing with 75 mM NaCl/7.5 mM 
sodium citrate (0.5 x SSC)/1% SDS at 65°C. "Low stringency" conditions include 

20 hybridizing vnUx plaque screen buffer, 10% dextran sulfate and 110 ug/ml denatured, 
sonicated salmon sperm DNA at 55°C for 12-20 hours, and washing with 300 mM 
NaCl/30mM sodium citrate (2.0 X SSC)/1% SDS at 55°C. See also Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc. New York, Sections 6.3.1-6.3.6, (1989). 

A "therapeutic composition" as used herein is defined as comprising the 

25 therapeutics of the invention and other biologically compatible ingredients. The 
therapeutic composition may contain excipients such as water, minerals and carriers 
such as protein. 
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"wild type" - the naturally-occurring polynucleotide sequence of an exon of a 
protein, or a portion thereof, or protein sequence, or portion thereof, respectively, as it 
normally exists in vivo. 

Practice of the present invention will employ, unless indicated otherwise, 
5 conventional techniques of cell biology, cell culture, molecular biology, microbiology, 
recombinant DNA, protein chemistry, and immunology, which are within the skill of the 
art. Such techniques are described in the literature. Unless stipulated otherwise, all 
references cited in the Detailed Description are incorporated herein by reference. 

10 n. General Properties of Isolated Hedgehog Proteins 

Hedgehogs are a family of genes which begin expression early in development 
and are involved in the morphogenesis of a number of organs in the developing embryo 
(Ingham, 1995, Perrimon, 1995; Johnson and Tabin, 1995; Hammerschniidt et al., 

1997) . 

15 However, there is currentiy no evidence that hedgehogs are directiy involved in 

the development of the mammalian vasculature. Knockouts of each of the mammalian 
hedgehog genes, sonic (Chiang et al., 1996; Litingtung et al., 1998; St- Jacques et al., 

1998) , Indian (St-Jacques et al., 1999; Karp et al., 2000) and desert (Bitgood et al., 
1996; Parmantier et al., 1999) hedgehog have not been reported to have defects in 

20 vascular development, but do show defects in tissues where they are known to function 
in development 

The adult functions of tiie hedgehog proteins are not well understood. Hedgehog 
is known to be expressed in adult bone/cartilage, central and peripheral nervous system, 
kidney, eye and several other tissues (Valentine et al., 1997; Traififort et al., 1998 and 
25 1999; Iwamoto et al., 1999; Jensen et al., 1997; Parmantier et al., 1999). The adult 
function of the hedgehog pathway is perhaps best understood in bone and cartilage 
where it regulates the differentiation of chondrocytes by modulating PTHrp (Iwamoto et 
al., 1999; Kazp et al., 2000). Administration of hedgehog locally in the skin also can 
induce hair growth in adult animals (Sato et al., 1999; Wang et al., 2000). 
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The various naturaUy-oc^iimng hedgehog proteins from which the subject 
therapeutics can be derived are characterized by a signal peptide, a highly conserved N*- 
terminal region (see Figure 1), and a more divergent C-tenninal domain. In addition to 
signal sequence cleavage in the secretory pathway (Lee, JJ. et al. (1992) Cell 71:33-50; 

5 Tabata, T. et al. (1992) Genes Dev. 2635-2645; Chang, D.E. et al. (1994) Development 
120:3339-3353), hedgehog precursor proteins naturally undergo an internal 
autoproteolytic cleavage which depends on conserved sequences in the C-terminal 
portion (Lee et al. (1994) Science 266:1528-1537; Porter et al. (1995) Nature 374:363- 
366). This autocleavage leads to a 19 kD N-terminal peptide and a C-terminal peptide of 

10 26-28 kD. The N-tenninal peptide stays tightly associated with the surface of cells in 
which it was synthesized, while flie C-terminal peptide is freely diffusible both in vitro 
and in vivo. Cell surface retention of the N-terminal peptide is dependent on 
autocleavage, as a truncated form of hedgehog encoded by an RNA which terminates 
precisely at the normal position of internal cleavage is difRisible in vitro (Porter et al. 

15 (1995) supra) and in vivo (Porter, J.A. et al, (1996) Cell 86, 21-34). Biochemical studies 
have shown that the autoproteolytic cleavage of the hedgehog precursor protein 
proceeds through an internal thioester intermediate, which subsequently is cleaved in a 
nucleophilic substitution. It is suggested that the nucleophile is a small lipophilic 
molecule, more particularly cholesterol, which becomes covalently boimd to the C- 

20 terminal end of the N-peptide (Porter et al (1996) sixprd)^ tethering it to the cell surface. 

The vertebrate family of hedgehog genes includes at least four members, e.g., 
paralogs of the single drosophila hedgehog gene (reference). Three of these members, 
herein referred to as Desert hedgehog (Dhh), Sonic hedgehog (Shh) and Indian 
hedgehog (Ihh), apparently exist in all vertebrates, including fish, birds, and mammals. 

25 A fourth member, herein referred to as tiggie-winlde hedgehog (Thh), appears specific 
to fish. Isolated hedgehog proteins used in the methods of this invention are naturally 
occxirring or recombinant proteins of the hedgehog family and may be obtainable from 
either invertebrate or from vertebrate sources (see references below). Members of the 
vertebrate hedgehog protein femily share homology with proteins encoded by the 
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Drosophila hedgehog (hh) gene (Mohler and Vani, (1992) Development 115, 957-971). 
Other members continue to be identified. 

Mouse and chicken Shh and mouse Ihh genes (see, for example, U.S. Patent 
5,789,543) encode glycoproteins vMch undergo cleavage, yielding an amino terminal 
fragment of about 20kDa and a carboxy terminal fragment of about 25kDa. The most 
preferred 20kDa fragment has the consensus sequence SEQ ID NO: 26 which includes 
ihe amino acid sequences of SEQ ID NOS: 23-25. Various other fragments that 
encompass the 20kDa moiety are considered within the presently claimed invention. 
Publications disclosing these sequences, as well as their chemical and physical 
properties, include Hall et al., (1995) Nature 378, 212-216; EWcer et al., (1995) Current 
Biology 5, 944-955; Fan et al., (1995) Cell 81, 457-465, Chang et al., (1994) 
Development 120, 3339-3353; Echelard et al., (1993) Cell 75, 1414-1430 34-38; PCX 
Patent AppUcation WO 95/23223 (Jessell, Dodd, Roelmk and Edlund); PCX Patent 
Publication WO 95/18856 (Ingham, McMahon and Xabin). U.S. Patent 5,759,811 lists 
the Genbank accession nimibers of a complete mRNA sequence encoding human Sonic 
hedgehog; a partial sequence of human Indian hedgehog mRNA, 5* end; and a partial 
sequence of human Desert hedgehog mRNA. Xhe hedgehog therapeutic compositions of 
the subject method can be generated by any of a variety of techniques, including 
purification of naturally occurring proteins, recombinantly produced proteins and 
synthetic chemistry. Polypeptide forms of the hedgehog therapeutics are preferably 
derived from vertebrate hedgehog proteins, e.g., have sequences corresponding to 
naturally occurring hedgehog proteins, or fragments tiiereof, from vertebrate organisms. 
However, it will be appreciated that the hedgehog polypeptide can correspond to a 
hedgehog protein (or fragment thereof) which occurs in any metazoan organism. 

Xhe vertebrate family of hedgehog genes includes at least four members, e.g., 
paralogs of the single drosophila hedgehog gene (SEQ ID No. 19). Xhree of these 
members, hereia referred to as Desert hedgehog (Dhh), Sonic hedgehog (Shh) and 
Indian hedgehog (Ihh), apparentiy exist in all vertebrates, including fish, birds, and 
mammals. A fourth member, herein referred to as tiggie-winkle hedgehog (Xhh), 
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appears specific to fish. According to the appended sequence listing, (see also Table 1) a 
chicken Shh polypeptide is encoded by SEQ ID No: 1; a mouse Dhh polypeptide is 
encoded by SEQ ID No:2; a mouse Ihh polypeptide is encoded by SEQ ID No:3; a 
mouse Shh polypeptide is encoded by SEQ ID No:4 a zebrafish Shh polypeptide is 
encoded by SEQ ID No:5; a human Shh polypeptide is encoded by SEQ ID No:6; a 
human Ihh polypeptide is encoded by SEQ ID No:7; a human Dhh polypeptide is 
encoded by SEQ ID No. 8; and a zebrafish Thh is encoded by SEQ ID No. 9. 

Table 1 



Guide to hedgehog sequences in Sequence Listing 





Nucleotide 


Amino Acid 


Chicken Shh 


SEQ ID No. 1 


SEQ ID No. 10 


Mouse Dhh 


SEQ ID No. 2 


SEQ ID No. 11 


Mooiselhh 


SEQ ID No. 3 


SEQ ID No. 12 


Moiise Shh 


SEQ ID No. 4 


SEQ ID No. 13 


Zebrafish Shh 


SEQ ID No. 5 


SEQ ID No. 14 


Human Shh 


SEQ ID No. 6 


SEQ ID No. 15 


Human Ihh 


SEQ ID No. 7 


SEQ ID No. 16 


Human Dhh 


SEQ ID No. 8 


SEQ ID No. 17 


zebrafish Thh 


SEQ ID No. 9 


SEQ ID No. 18 


DrosophilaHH 


SEQ ID No. 19 


SEQ ID No. 20 



In addition to the sequence variation between the various hedgehog homologs, 
the hedgehog proteins are apparently present naturally in a niimber of different forms, 
including a pro-form, a fiiU-length mature form, and several processed firagments 
thereof. The pro-form includes an N-terminal signal peptide for directed secretion of the 
extracellular domain, while the fiill-length mature form lacks this signal sequence. 

As described above, fiirther processing of the mature form occurs in some 
instances to yield biologically active fragments of the protein. For instance, sonic 
hedgehog undergoes additional proteolytic processing to yield two peptides of 
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approximately 19 kDa and 27 kDa, the 19kDa fragment corresponding to an proteolj^c 
N-terminal portion of the mature protein. 

In addition to proteolytic firagmentadon, the vertebrate hedgehog proteins can 
also be modified post-translationally, such as by glycosylation and/or addition of 
5 Upophilic moieties, such as stents, fatty acids, etc., though bacterially produced (e.g. 
unmodified) forms of the proteins still maintain certain of the bioactivities of the native 
protein. Bioactive firagments of hedgehog polypeptides of the present invention have 
been generated and are described in great detail in, e.g., PCX publications WO 95/18856 
and WO 96/17924. 

10 A "hedgehog then^eutic" of the invention is defined in t^ms of having at least a 

portion that consists of the consensus amino acid sequence of SEQ ID NO: 26 or at least 
a portion that consists of SEQ ID NOS: 10-18 or 23-25. The term also means a 
hedgehog polypeptide, or a fimctional variant of a hedgehog polypeptide, or homolog of 
a hedgehog polypeptide, or functional variant, which has biological activity and can 

IS modulate angiogenesis. 

Members useful in the methods of the invention include any of the naturally- 
occiuring native hedgehog proteins including allelic, phylogenetic counterparts or other 
variants thereof, whether naturally-sourced or produced chemically including muteins or 
mutant proteins, as well as recombmant forms and new, active members of the 

20 hedgehog family. Particularly usefiil hedgehog polypeptides have portions that include 
all or part of SEQ ID NOS: 23-26. 

Hedgehog thempeutics may also include polypeptides having an amino acid 
sequence at least 60%, 80%, 90%, 95%, 98%, or 99% homologous to an amino acid 
sequence from SEQ ID NOS 10-18 or 23-26. The polypeptide can also include an amino 

25 acid sequence essentially the same as an amino acid sequence in SEQ ID NOS: 10-18 or 
23-26. The polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length and 
includes at least 5, preferably at least 10, more preferably at least 20, most preferably at 
least 50, 100, or 150 contiguous amino acids from SEQ ID NOS: 10-18 or 23-26. 
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Polypeptides of tibe invention include those which arise as a result of the 
existence of multiple genes, alternative transcription events, alternative RNA splicing 
events, and alternative translational and posttranslational events. The polypeptide can be 
made entirely by synthetic means or can be expressed in systratis, e.g., cultured cells, 
which result in substantially the same posttranslational modifications present when the 
protein is expressed in a native cell, or in systems which result in the omission of 
posttranslational modifications present when expressed in a native cell. 

Moreover, mutagenesis can be used to create modified hh polypeptides, e.g., for 
such purposes as enhancing therapeutic or prophylactic efficacy, or stability (e.g., 
ex vivo shelf life and resistance to proteolytic degmdation in vivo). Such modified 
peptides can be produced, for instance, by amino acid substitution, deletion, or addition. 
Modified hedgehog polypeptides can also include those with altered post-translational 
processing relative to a naturally occurring hedgehog protein, e.g., altered glycosylation, 
cholesterolization, prenylation and the like. 

Li one embodiment, a hedgehog therapeutic is a hedgehog polypq)tide with one 
or more of the following characteristics: 

(i) it has at least 30, 40, 42, 50, 60, 70, 80, 90 or 95% sequence identity with 

amino acids of SEQ ID NOS: 23-26; 

- (ii) it has a cysteine or a fimctional equivalent as the N-terminal end; 

(iii) it may induce alkaline phosphatase activity in C3H10T1/2 cells; 

(iv) it has an overall sequence identity of at least 50%, preferably at least 60%, 
more preferably at least 70, 80, 90, or 95%, with a polypeptide of SEQ ID NOS: 
10-18; 

(v) it can be isolated fi:om natural sources such as mammalian cells; 

(vi) it can bind or interact with patched; and 

(vii) it may be modified at af least one amino acid residue by a polyalkylene 
glycol polymer attached to the residue or, optionally, via a linker molecule to the 
amino acid residue. 
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Preferred nucleic acids encode a polypeptide comprising an anuno acid sequence 
at least 60% homologous or identical, more preferably 70% homologous or identicad, 
and most preferably 80% homologous or identical with an amino acid sequence selected 
fix}m the groiq) consisting of SEQ ID NOS: 10-18 or 23-26. Nucleic acids which encode 
5 polypeptides at least about 90%, more prefCTably at least about 95%, and most 
prefembly at least about 98-99% homology or identity with an anaino acid sequence 
represented in one of SEQ ID Nos: 23-26 are also within the scope of the invention. 

In another embodiment, the hedgehog therapeutic is a polypeptide encodable by 
a nucleotide sequence that hybridizes xrnder stringent conditions to a hedgehog coding 
10 sequence represented in one or more of SEQ ID NOS: 1-9, 19 or 23-26. 

Preferred nucleic acids encode a hedgehog polypeptide comprising an amino 
acid sequence at least 60% homologous, more preferably 70% homologous and most 
preferably 80% homologous with an amino acid sequence selected from the group 
consisting of SEQ ID Nos:8-14. Nucleic acids which encode polypeptides at least about 
15 90%, more preferably at least about 95%, and most preferably at least about 98-99% 
homology with an amino acid sequence represented in one of SEQ ID Nos: 10-18 or 20 
are also within the scope of the invention. 

Hedgehog therapeutics, in addition to native hedgehog proteins, are at least 60% 
homologous, more preferably 70% homologous and most preferably 80% homologous 
20 Avith an amino acid sequence represented by any of SEQ ID Nos: 10-18 or 20. 
Polypeptides which are at least 90%, more preferably at least 95%, and most preferably 
at least about 98-99% homologous with a sequence selected from the group consisting 
of SEQ ID Nos: 10-1 8 or 20 are also within the scope of the invention. 

With respect to fragmoits of hedgehog polypeptide, preferred hedgehogs 
25 moieties include at least SO amino acid residues of a hedgehog polypeptide, more 
prefembly at least 100, and even more prefembly at least 1 SO. 

Another preferred hedgehog polypeptide which can be included in the hedgehog 
therapeutic is an N-terminal fragment of the mature protein having a molecular weight 
of approximately 1 9 kDa. 
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Preferred human hedgehog proteins include N-terminal fragments corresponding 
approximately to residues 24-197 of SEQ ID No. 15, 28-202 of SEQ ID No. 16, and 23- 
198 of SEQ ID No. 17. By "corresponding approximately" it is meant that the sequence 
of interest is at most 20 amino acid residues dififerent in length to tiie reference 

5 sequence, though more preferably at most 5, 10 or 1 5 amino acid different in length. 

Still other preferred hedgehog therapeutics include an amino acid sequence 
represented by the formula A-B wherein: (i) A represents all or the portion of the amino 
acid sequence designated by residues 24-193 of SEQ ID No: 15; and B represents at 
least one amino acid residue of the amino acid sequence designated by residues 194-250 

10 of SEQ ID No: 15; (ii) A represents all or the portion of the amino acid sequence 
designated by residues 25-193 of SEQ ID No; 13; and B represents at least one amino 
acid residue of the amino acid sequence designated by residues 194-250 of SEQ ID No: 
13; (iii) A represents all or the portion of the amino acid sequence designated by 
residues 23-193 of SEQ ID No: 11; and B represents at least one amino acid residue of 

15 the amino acid sequence designated by residues 194-250 of SEQ ID No: 1 1; (iv) A 
represents all or the portion of the amino acid sequence designated by residues 28-197 
of SEQ ID No: 12; and B represents at least one amino acid residue of the amino acid 
sequence designated by residues 198-250 of SEQ ID No: 12; (v) A represents all or the 
portion of the amino acid sequence designated by residues 29-197 of SEQ ID No: 16; 

20 and B represents at least one amino acid residue of the amino acid sequence designated 
by residues 198-250 of SEQ ID No: 16; or (vi) A represents all or the portion of the 
amino acid sequence designated by residues 23-193 of SEQ ID No. 17, and B represents 
at least one amino acid residue of the amino acid sequence designated by residues 194- 
250 of SEQ ID No. 17. In certain preferred embodiments, A and B together represent a 

25 contiguous polypeptide sequence designated sequence, A represents at least 25, 50, 75, 
100, 125 or 150 amino acids of the designated sequence, and B represents at least 5, 10, 
or 20 aixdno acid residues of the amino acid sequence designated by corresponding entry 
in the sequence listing, and A and B togeth^ preferably represent a contiguous sequence 
corresponding to the sequence listing entry. Similar fragmmts from other hedgehog also 
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contemplated, e.g., fragments which corre^ond to the preferred fragments from the 
sequence listing entries which are enumerated above. 

ni. Production of Recombinant Polypeptides 

5 Isolated hedgehog polypeptides described herein can be produced by any 

suitable method known in the art. Such methods range from direct protein synthetic 
methods to constructing a DNA sequence encoding isolated polypeptide sequences and 
expressing those sequences in a suitable transformed host. 

In one embodiment of a recombinant method, a DNA sequence is constructed by 

10 isolating or synthesizing a DNA sequence encoding a wild type protein of interest. 
Optionally, the sequence may be mutagenized by site-specific mutagenesis to provide 
functional analogs thereof. See, e.g.. United States Patent 4,588,585. Another method of 
constructing a DNA seqtience encoding a polypeptide of interest would be by chemical 
synthesis using an oligonucleotide synthesizer. Such oligonucleotides may be preferably 

15 designed based on the amino acid sequence of the desired polypeptide, and preferably 
selecting those codons that are favored in tiie host cell in which tiie recombinant 
polypeptide of interest will be produced. 

Standard methods may be applied to synthesize an isolated polynucleotide 
sequence encoding an isolated polypeptide of interest. For example, a complete amino 

20 acid sequence may be used to construct a back-translated gene. See Maniatis et al., 
supra. Further, a DNA oligomer containing a nucleotide sequence coding for the 
particular isolated polypeptide may be synthesized. For example, several small 
oligonucleotides coding for portions of the desired polypeptide may be synthesized and 
then ligated. The individual oligonucleotides typically contain 5' or 3' overhangs for 

25 complementary assembly. 

Once assembled (by synthesis, site-directed mutagenesis, or by another metiiiod), 
the mutant DNA sequences encoding a particular isolated polypeptide of interest will be 
inserted into an e?qpression vector and operatively linked to an egression control 
sequence ^propriate for expression of the protein in a desired host Proper assembly 
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en2yme, the size of the polypeptide, how easily the polypeptide is proteolytically 
degraded, and the like. The choice of a vector and insertion site for a given DNA is 
determined by a balance of these factors. 

To provide for adequate transcription of tiie recombinant constructs of the 

5 invention, a suitable promoter/enhancer sequence may preferably be incorporated into 
the recombinant vector, provided that the promoter/expression control sequence is 
capable of driving transcription of a nucleotide sequence encoding a hedgehog protein. 
Any of a vsdde variety of expression control sequences may be used in these vectors. 
Such useful expression control sequences include the expression control sequences 

10 associated vdth structural genes of the foregoing expression vectors. Examples of useful 
expression control sequences include, for example, the-early and late promoters of SV40 
or adenovirus, the lac system, the trp system, the TAG or TRC system, the major 
operator and promoter regions of phage lambda, for example pL, the control regions of 
fd coat protein, the promoter for 3-phosphoglycerate kinase or other glycolytic enzjnnes, 

15 the promoters of acid phosphatase, e.g., Pho5, the promoters of the yeast alpha-mating 
system and other sequences known to control the expression of genes of prokaryotic or 
eukaryotic cells and their viruses, and various combinations thereof. 

Promoters which may be used to control the expression of immunoglobulin- 
based fusion protein include, but are not limited to, the SV40 early promoter region 

20 (Benoist and Chambon, 1981, Nature 290:304-310), the promoter contained in the 3* 
long terminal repeat of Rous sarcoma virus (Yamamoto, et al., 1980, Cell 22:787-797), 
the heipes thymidine kinase promoter (Wagnw et al., 1981, Prbc. Natl. Acad. Sci. 
U.S.A. 78:144-1445), the regulatory sequences of the metallothionine gene (Brinster et 
al., 1982, Nature 296:39-42); plant expression vectors comprising the nopaline 

25 synthetase promoter region (Herrera-Estrella et al.. Nature 303:209-213) or the 
cauliflower mosaic virus 35S RNA promoter (Gardner, et al., 1981, Nucl. Acids Res. 
9:2871), and the promoter for the photosynlhetic enzyme ribulose biphosphate 
carboxylase (Herrera-Estrella et al., 1984, Nature 310:115-120); promoter elements 
from yeast or other fungi such as the Gal 4 promoter, the ADC (alcohol dehydrogenase) 

30 promoter, PGK (phosphoglycerol kinase) promoter, alkaline phophatase promoter, and 
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the following animal transcriptional control regions, which exhibit tissue specificity and 
have been utilized in transgenic animals: elastase I gene control region which is active 
in pancreatic cells (Swift et al., 1984, Cell 38:639-646; Omitz et al., 1986, Cold Spring 
Harbor Symp. Quant Biol. 50:399-409; MacDonald, 1987, Hepatology 7:425-515); 
insulin gene enhancers or promoters which are active in pancreatic cells (Hanahan, 
1985, Nature 315:115*122); immunoglobulin gene enhancers or promoters which are 
active in lymphoid cells (Grosschedl et al., 1984, Cell 38:647-658; Adames et al., 1985, 
Nature 318:533-538; Alexander et al., 1987, Mol. Cell. Biol. 7:1436-1444); the 
cytomegalovirus early promoter and enhancer regions (Boshart et al., 1985, Cell 41 :521- 
530); moiise mammary tumor virus control region which is active in testicular, breast, 
lymphoid and mast cells (Led^ et al., 1986, Cell 45:485-495); albumin gene control 
region which is active in liver (Pinkert et al., 1987, Genes and Devel. 1:268-276); alpha- 
fetoprotein gene control region which is active in liver (Krumlauf et al., 1985, Mol. Cell. 
Biol. 5:1639-1648; Hammer et al., 1987, Science 235:53-58); alphantitrypsin gene 
control region which is active in the liver (Kelsey et al, 1987, Gtenes and Devel. 1:161- 
171); -globin gene control region which is active in myeloid cells (Mogram et al., 1985, 
Nature 315:338-340; KoUias et al., 1986, Cell 46:89-94; myelin basic protein gene 
control region which is active in oligodendrocyte cells in the brain (Readhead et al., 
1987, Cell 48:703-712); myosin light chain-2 gene control region which is active in 
skeletal muscle (Sani, 1985, Nature 314:283-286); and gonadotropic releasing hormone 
gene control region which is active, in the hypothalamus (Mason et al., 1986, Science 
234:1372-1378). 

Any suitable host may be used to produce in quantity the isolated hedgehog 
polypeptides described herein, including bacteria, fungi (including yeasts), plants, 
insects, mammals, or other appropriate animal cells or cell lines, as well as transgenic 
animals or plants. More particularly, these hosts may include well known eukaryotic and 
prokaryotic hosts, such as strains of E. coli, Pseudomonas, Bacillus, Streptomyces, 
fungi, yeast (e.g., Hansenula ), insect cells such as Spodoptera frugiperda (SF9), and 
High Five TM , animal cells such as Chinese hamster ovary (CHO), moxise cells such as 



-27- 



wo 01/98344 



PCT/USOl/19435 



NS/O cells, African green monkey cells, COS 1, COS 7, BSC 1, BSC 40, and BMT 10, 
and human cells, as well as plant cells. 

It should be understood that not all vectors and egression control sequences will 
function equally well to express a given isolated polypeptide. Neither will all hosts 

5 function equally well with the same expression system. However, one of skill in the art 
may make a selection among these vectors, expression control systems and hosts 
without undue experimentation. For example, to produce isolated polypeptide of interest 
in large-scale animal cxilture, the copy number of the expression vector must be 
controlled. Amplifiable vectors are well known in the art See, for example, Kaufinan 

10 and Sharp, (1982) MoL CeU. Biol., 2. 1304-1319 and U.S. Patents 4,470,461 and 
5,122,464. 

Such operative linking of a DNA sequence to an expression control sequence 
includes the provision of a translation start signal in the correct reading frame upstream 
of the DNA sequence. If the particular DNA sequence being expressed does not begin 

15 with a methionine, the start signed will result in an additional amino acid (methionine) 
being located at the N-terminus of the product. If a hydrophobic moiely is to be linked 
to the N-terminal methionyl-containing protein, the protein may be employed directly in 
the compositions of the invention. Neverthless, since the preferred N-tenninal end of the 
protein is to consist of a cysteine (or functional equivalent) the methionine must be 

20 removed before use. Methods are available in the art to remove such N-terminal 
methionines from polypeptides expressed with them. For example, certain hosts and 
fermentation conditions permit removal of substantially all of the N-terminal methionine 
m vivo. Other hosts require in vitro removal of the N-terminal methionine. Such in vitro 
and in vivo methods are well known in the art. 

25 Successful incorporation of these polynucleotide constructs into a given 

expression vector may be identified by three general approaches: (a) DNA-DNA 
hybridization, (b) presence or absence of "marker" gene functions, and (c) expression of 
inserted sequences. In the first approach, the presence of the hedgehog gene inserted in 
an expression vector can be detected by DNA-DNA hybridization using probes 

30 comprising sequences that are homologous to the inserted fusion protein gene. In the 
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second approach, the recomhinant vector/host system can be identified and selected 
based upon the presence or absence of certain "marker" gene functions (e.g., thymidine 
kinase activity, resistance to antibiotics such as G418, transformation phenotype, 
occlusion body formation in baculovirus, etc.) caused by the insertion of foreign genes 

5 in the vector. For example, if the polynucleotide is inserted so as to intermpt a marker 
gene sequence of the vector, recombinants containing the insert can be identified by the 
absence of the marker gene fimction. In the third approach, recombinant e3q)ression 
vectors can be identified by assaying the foreign gene product expressed by the 
recombinant vector. Such assays can be based, for example, on the physical or 

10 functional properties of the gene product in bioassay systems. 

Recombinant nucleic acid molecules which encode chimeric hedgehog 
therapeutics may be obtained by any method known in the art (Maniatis et al., 1982, 
Molecular Cloning; A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, N.Y.) or obtained from publicly available clones. Methods for the preparation 

15 of genes which encode the heavy or light chain constant regions of immunoglobulins are 
taught, for example, by Robinson, R- et al., PCT Application, Publication No. W087- 
02671. The cDNA sequence encoding the hedgehog molecule or flragment may be 
directly joined to the cDNA encoding the heavy Ig contant regions or may be joined via 
a linker sequence. In further embodiments of the invention, a recombinant vector system 

20 may be areated to accommodate sequences encoding hedgehog in the coirect reading 
frame with a synthetic hinge region. Additionally, it may be desirable to include, as part 
of the recombinant vector system, nucleic acids corresponding to the 3' flanking region 
of an immunoglobulin gene including RNA cleavage/polyadenylation sites and 
downstream sequences. Furthermore, it may be desirable to engineer a signal sequence 

25 upstream of the immimoglobulin fusion protein-encoding sequences to facilitate the 
secretion of the fused molecule fix>m a cell transformed with the recombinant vector. 

The proteins produced by a transformed host can be purified according to any 
suitable method. Such standard methods include chromatography (e.g., ion exchange, 
affinity, and sizing column chromatography), centrifugation, differential solubility, or 

30 by any other standard technique for protein purification. For immunoafiBnity 
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. chromatography (See Example \ a protein such as Sonic hedgehog may be isolated by 
binding it to an affinity colnnm comprising of antibodies that were raised against Sonic 
hedgehog, or a related protein and were affixed to a stationary siq>port. For example, the 
hedgehog proteins and fragments may be purified by passing a solxidon thereof tibrough 

5 a column having an hedgehog receptor immobilized thereon (see U.S.Pat. No. 
4,725,669). The bound hedgehog molecule may then be eluted by treatment with a 
chaotropic salt or by elution with aqueous acetic acid. Specific inmiunoglobulin fiision 
proteins may be purified by passing a solution containing the fusion protein through a 
column which contains immobilized protein A or protein G which selectively binds the 

10 Fc portion of the fiision protein. See, for example, Reis, K. J., et al., J. Immunol. 
132:3098-3102 (1984); PCT Application, PubUcation No. W087/00329. 

Alternatively hedgehog proteins and chimeric molecules may be purified on 
anti-hedgehog antibody columns, or on anti-immunoglobulin antibody columns to give 

15 a substantially pure protein. By the term "substantially pure" is intended that the protein 
is firee of the impurities that are naturally associated therewith. Substantial purity may be 
evidenced by a single band by electrophoresis. Alternatively, affinity tags such as 
hexahistidine, maltose binding domain, influ^iza coat sequence, and glutathione-S- 
transferase can be attached to the protein to allow easy purification by passage over an 

20 appropriate afiSnity column. Isolated proteins can also be characterized physically using 
such techniques as proteolysis, nuclear magnetic resonance, and X-ray crystallography. 

An example of a usefiil hedgehog/Ig chimeric protein of this invention is that 
protein encoded by the nucleotide sequence of SEQ ID NOS: 31-34, which are secreted 
into the cell culture by eukaryotic cells containing the expression plasmids pUB55, pUB 

25 114, pUB 115 and pUB 116, respectively (See Examples). These proteins consist of the 
mature hunmn hedgehog fiised to a portion of the hinge region and the CH2 and CH3 
constant domains of murine or human Ig. Proteins of this group contains a sufficient 
portion of the immimoglobulin to be recognized by the Fc binding protein. Protein A. 

30 
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A. Production of Fragments and Analogs 

Fragments of an isolated protein (e.g., fragments of SEQ ID NOS: 10-18 or 23- 
26) can also be produced efficiently by recombinant methods, by proteolytic digestion, 
or by chemical synthesis using methods known to fiiose of skill in the art In 

5 recombinant methods, internal or terminal fragments of a polypeptide can be generated 
by removing one or more nucleotides from one end (for a temunal fragment) or both 
ends (for an internal fragment) of a DNA sequence which encodes for the isolated 
hedgehog polypeptide. Expression of the mutagenized DNA produces polypeptide 
fragments. Digestion with "end nibbling" endonucleases can also generate DNAs which 

10 encode an array of fragments. DNAs which encode fragments of a protein can also be 
generated by random shearing, restriction digestion, or a combination of both. Protein 
fragments can be generated directly from intact proteins. Peptides can be cleaved 
specifically by proteolytic enzymes, including, but not limited to plasmin, thrombin, 
trypsin, chymotrypsin, or pepsin. Each of these enzymes is specific for the type of 

15 peptide bond it attacks. Trypsin catalyzes the hydrolysis of peptide bonds in which the 
carbonyl group is from a basic amino acid, usually arginine or lysine. Pepsin and 
chymotrypsin catalyse the hydrolysis of peptide bonds from aromatic amino acids, such 
as tryptophan, tyrosine, and phenylalanine. Alternative sets of cleaved protein fragments 
are generated by preventing cleavage at a site which is susceptible to a proteolytic 

20 enzyme. For instance, reaction of the E-amino acid group of lysine with 
ethyltrifluorothioacetate in mUdly basic solution yields blocked amino acid residues 
whose adjacent peptide bond is no longer susceptible to hydrolysis by trypsin. Proteins 
can be modified to create peptide linkages that are susceptible to proteolytic enzymes. 
For instance, alkylation of cysteine residues with (3-haloethylamines yields peptide 

25 linkages that are hydrolyzed by trypsin (Lindley, (1956) Nature 178, 647). In addition, 
chemical reagents that cleave peptide chains at specific residues can be used. For 
example, cyanogen bromide cleaves peptides at methionine residues (Gross and Witkip, 
(1961) J. Am. Chem. Soc. 83, 1510). Thus, by treating proteins with various 
combinations of modifiers, proteolytic enzymes and/or chemical reagents, the proteins 

-31- 



wo 01/98344 



PCTAJSOl/19435 



may be divided into firagments of a desired length with no overlap of the fragments, or 
divided into overlapping fragments of a desired length 

Fragments can also be synthesized chemically using techniques known in the art 
such as the Merrifield solid phase F moc or t-Boc chemistry. Men±5eld, Recent 
5 Progress in Hormone Research 23: 451 (1967). 

Examples of prior art methods which allow production and testing of fragments 
and analogs are discussed below. These, or analogous methods may be used to make 
and screen fragments and analogs of an isolated polypeptide (e.g., hedgehog) which can 
be shown to have biological activity. An exemplary method to test whether fragments 
10 and analogs of hedgehog have biological activity is found in Example - 

B. Production of Altered DNA and Peptide Sequences: Random Methods 
Amino acid sequence variants of a protein can be prepared by random 

mutagenesis of DNA which encodes the protein or a particular portion thereof. Useftd 
15 methods include PGR mutagenesis and saturation mutagenesis. A library of random 

amino acid sequence variants can also be generated by the synthesis of a set of 

degenerate oligonucleotide sequences. Methods of generating amino acid sequence 

variants of a given protein using altered DNA and peptides are well-known in the art. 

The following examples of such methods are not intended to limit the scope of the 
20 present invention, but merely serve to illustrate representative techniques. Persons 

having ordinary skill in the art will recognize that other methods are also useful in this 

regard. 

PGR Mutagenesis : See, for example Leung et al., (1989) Technique 1, 1 1-15. 
Saturation Mutagenesis : One method is described generally in Mayers et al., 
25 (1989) Science 229, 242. 

Degenerate Oligonucleotide Mutagenesis : See for example Harang, S.A., (1983) 
Tetrahedron 39, 3; Itakura et al., (1984) Ann. Rev. Biochem. 53, 323 and Itakura 
et al.. Recombinant DNA, Proc. 3rd Gleveland Symposium on Macromolecules, 
pp. 273-289 (A,G. Walton, ed.), Elsevier, Amsterdam, 1981. 

30 
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C. Production of Altered DNA and Peptide Sequences: Directed Methods 
Non-random, or directed, mutagenesis provides specific sequences or mutations 

in specific portions of a polynucleotide sequence that encodes an isolated polypeptide, 
to provide variants which include deletions, insertions, or substitutions of residues of the 
5 known amino acid sequence of the isolated polypeptide. The mutation sites may be 
modified individually or in series, for instance by: (1) substituting first wifli conserved 
amino acids and then with more radical choices depending on the results achieved; (2) 
deleting the target residue; or (3) inserting residues of the same or a different class 
adjacent to the located site, or combinations of options 1-3. 
10 Clearly^ such site-directed methods are one way in which an N-terminal cysteine 

(or a fimctional equivalent) can be introduced into a given polypeptide sequence to 
provide the attachment site for a hydrophobic moiety. 

Alanine s canninfr Mutagenesis : See Cunningham and Wells, (1989) Science 244, 

1081-1085). 

15 Oligonucleotide-Mediated Mutagenesis : See, for example, Adelman et al., 

(1983) DNA 2, 183. 

Cassette Mutagenesis : See Wells et al., (1985) Gene 34, 315. 
Combinatorial Mutagenesis : See, for example, Ladner et al., WO 88/06630 

20 Indeed, it is plain from tiie combinatorial mutagenesis art that large scale 

mutagenesis of hedgehog proteins, without any preconceived ideas of which residues 
were critical to the biological function, and generate wide armys of variants having 
equivalent biological activity. Indeed, it is the ability of combinatorial techniques to 
screen billions of different variants by high throughout analysis that removes any 

25 requirement of a priori understanding or knowledge of critical residues. 

D. Other Variants of Isolated Polypeptides 

Included in the invention are isolated molecules that are: allelic variants, natural 
mutants, induced mutants, and proteins encoded by DNA that hybridizes under hig^ or 
30 low stringency conditions to a nucleic acid which encodes a polypeptide such as tiie N- 
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tenninal fragment of Sonic hedgehog (SEQ ID NO: 23) and polypq>tides bound 
specifically by antisera to hedgehog peptides, especially by antisera to an active site or 
binding site of hedgehog. All variants described herein are expected to: (i) retain the 
biological function of the original protein and (ii) retain the ability to link to form a 

5 chimeric molecule with a non-hedgehog moiety. 

The methods of the invention also feature uses of firagments, preferably 
biologically active fragments, or analogs of an isolated peptide such as hedgehog. 
Specifically, a biologically active fragment or analog is one having any in vivo or in 
vitro activity which is characteristic of the peptide shown in SEQ ID NOS: 10-1 8 or 23- 

10 26 or of other naturally occurring isolated hedgehog. Most preferably, the 
hydrophobically-modified fragment or analog has at least 10%, preferably 40% or 
greater, or most preferably at least 90% of the activity of Sonic hedgehog in any in vivo 
or in vitro assay. 

Analogs can differ from naturally occurring isolated protein in amino acid 

15 sequence or in ways that do not involve sequence, or both. The most preferred 
polypeptides of the invention have preferred non-sequence modifications that include in 
vivo or in vitro chemical derivatization (e.g., of their N-tenninal end). Hedgehog 
polypeptides may also be chemically modified to create hedgehog derivatives by 
forming covalent or aggregate conjugates with other chemical moieties, such as glycosyl 

20 groups, cholesterol, isoprenoids, lipids, phosphate, acetyl groups and the like. Covalent 
derivatives of hedgehog proteins can be prepared by linking the chemical moieties to 
Actional groups on amino acid sidechains of the protein or at the N-tenninus or at the 
C-terminus of the polypeptide. 

For instance, hedgehog proteins can be generated to include a moiety, other than 

25 sequence naturally associated with the protein, that binds a component of the 
extracellular matrix and enhances localization of the analog to cell sur&ces. For 
example, sequences derived from the fibronectin "type-m repeat", such as a tetrapeptide 
sequence R-G-D-S (Pierschbacher et al. (1984) Nature 309:30-3; and Komblihtt et al. 
(1985) EMBO 4:1755-9) can be added to the hedgehog polypeptide to support 

30 attachment of the chimeric molecule to a cell tbroiigh binding ECM components 
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(Ruoslahti et al. (1987) Science 238:491-497; Pierschbacheret al. (1987) J. Biol Chem. 
262:17294-8.; Hynes (1987) Cell 48:549-54; and Hynes (1992) Cell 69:1 1-25). 

Other analogs include a protein such as Sonic hedgehog or its biologically active 
firagments whose sequences differ from the wild type consensus sequence (e.g., SEQ ID 
NO: 26) by one or more conservative amino acid substitutions or by one or more non 
conservative amino acid substitutions, or by deletions or insertions which do not abolish 
die isolated protein's biological activity. Conservative substitutions typically include the 
substitution of one amino acid for another with similar characteristics such as 
substitutions within the following groups: valine, alanine and glycine; leucine and 
isoleuoine; aspartic acid and glutamic acid; asparagine and glutamine; serine and 
threonine; lysine and arginine; and phenylalanine and tyrosine. The non-polar 
hydrophobic amino acids include alanine, leucine, isoleucine, valine, proline, 
phenylalanine, tryptophan, and methionine. The polar neutral amino acids include 
glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine. The positively 
charged (basic) amino acids include arginine, lysine, and histidine. The negatively 
charged (acidic) amino acids include aspartic acid and glutanuc acid. Other conservative 
substitutions can be readily known by workers of ordinary skilL For example, for the 
amino acid alanine, a conservative substitution can be taken from any one of D-alanine, 
glycine, beta-alanine, L-cysteine, and D-cysteine. For lysine, a replacement can be any 
one of D-lysine, arginine, D-arginine, homo-arginine, methionine, D-metfaionine, 
ornithine, or D-omithine. 

Other analogs used within the methods of the invention are those with 
modifications which increase peptide stability. Such analogs may contain, for example, 
one or more non-peptide bonds (which replace tiie peptide bonds) in the peptide 
sequence. Also included are: analogs that include residues other than naturally occurring 
L-amino acids, such as D-amino acids or non-naturally occurring or syntiietic amino 
acids such as beta or ganmia amino acids and cyclic analogs. Incorporation of D- instead 
of L-amino acids into the isolated hedgehog polypeptide may increase its resistance to 
proteases. See, U.S. Patent 5,219,990 supra. The term "fragment", as q)plied to an 
isolated hedgehog analog, can be as small as a single amino acid provided that it retains 
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biological activity. It may be at least about 20 residues, more typically at least about 40 
residues, preferably at least about 60 residues in length. Fragments can be generated by 
methods known to those skilled in the art. The ability of a candidate fragment to exhibit 
isolated hedgehog biological activity can be also assessed by methods known to those 
5 skilled in the art as described herein. 

IV. Antagonists of Hedgehog Activity 

A preferred antagonist has at least the following properties: (i) the isolated 
protein binds the receptor patched-1 with an a£Qnity that may be less than, but is 

10 preferably at least the same as, the binding of mature hedgehog protein to patched-1; 
and (ii) the isolated protein blocks alkaline phosphatase (AP) induction by mature 
hedgehog protein when tested in an in vitro CH310T1/2 cell-based AP induction assay. 
Antagonists of the invention may also have the additional properties of being (iii) unable 
to induce ptc-1 and gli-1 expression. 

15 Persons having ordinary skill in the art can easily test any putative hedgehog 

antagonist for these properties. In particular, the mouse embryonic fibroblast line 
C3H10T1/2 is a mesenchymal stem cell line that is hedgehog responsive. Hedgehog 
treatment of the cells causes an upregulation of gli-1 and patched-1. (known indicators of 
hedgehog dependent signaling) and also causes induction of alkaline pho^hatase 

20 activity, an indicator that Hie cells have difG^ntiated down flie chondrocyte/ bone 
osteoblast lineage. Several hedgehog variants are imable to elicit a hedgehog-dependent 
response on C3H10T1/2 cells, but they competed with mature hedgehog for function 
and therefore serve as functional antagonists. The synthesis and use of such hedgehog 
antagonist moieties are briefly described below. 

25 

A. N-Modified Hedgehog Polypeptides as Antagonists 

Certain hedgehog variants that contain N-terminal modifications can block 
hedgehog function because they lack the ability to elicit a hedgehog-dependent response 
but retain the ability to bind to hedgehog receptor, patched-1. The critical primaiy amino 
30 acid sequence that defines whether a hedgehog polypeptide (i.e., a Sonic, Indian or 
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Desert hedgehog) is a functional hedgehog antagonist is fhe N-teiminal cysteine residue 
which corresponds to Cys-1 of the mature hedgehog. So long as the hedgehog 
polypeptide either lacks this N-tenninal cysteine completely or contains this N-tenninal 
cysteine in a modified form (e.g. chemically modified or included as part of an N- 
teraiinal extension moiety), the resulting polypeptide can act as a functional hedgehog 
antagonist. In this regard, the fact that an N-terminal cysteine "corresponds to Cys-1 " 
means: (a) tiie N-terminal cysteine is the Cys-1 of mature Sonic, Indian or Desert 
hedgehog; or (b) the N-terminal cysteine occupies the same position as Cys-1 of mature 
Sonic, Indian or Desert hedgehog. Provided that, for example, a Sonic hedgehog has an 
N-terminal cysteine corresponding to Cys-1 that is altered or otherwise modified as 
described herein, it can antagonize the action of any other member of the hedgehog 
family. Therefore, persons having ordinary skill in the art will imderstand that it is 
possible for an Indian hedgehog protein to antagonize the activity of Sonic, Desert or 
Indian hedgehogs. 

Examples of these antagonists with N-terminal modifications are included below 
and one skilled in the art can alter the disclosed structure of the antagonist, e.g., by 
producing fi^agments or analogs, and test the newly produced structures for antagonist 
activity. These examples in no way limit the structure of any related hedgehog 
antagonists, but are merely provided for fiiriher description. These, or analogous 
methods, can be used to make and screen firagments and analogs of a antagonist 
polypeptides. There are several variants that are able to function as antagonists. 

1 . N-terminal ^ctensions 

Antagonist polypeptides of the invention may include a hedgehog polypeptide 
sequence in which the N-terminal cysteuie is linked to an N-terminal extension moietjr. 
The isolated antagonist polypeptide can therefore be, as but one example, a recombinant 
fusion protein having: (a) a first N-terminal polypeptide portion that can be 5' to the 
hedgehog polypeptide itself, and that contains at least one element (e.g., an amino acid 
residue) that may be murelated to hedgehog, linked to (b) an N-terminal cysteine 
corresponding to Cys-1 of Sonic hedgehog that is part of a hedgehog antagonist of the 
invention, or a portion of hedgehog antagonist This N-terminal extension moiety (e.g., 
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the &st N-terminal polypq)tide portion) can be a histidine tag, a maltose binding 
protein, glutathione-S-transferase, a DNA binding domain, or a polymerase activating 
domain. The functional antagonist may include an N-terminal extension moiety that 
contains an element which replaces the Cys-1 of mature hedgehog or an N-terminal 
5 cysteine that corresponds to Cys-1 of a mature Sonic hedgehog. 

2. N-teraiinal deletions 

Another variation of a functional antagonist is a hedgehog protein that is missing 
no greater than about 12 amino acids beginning from that N-terminal cysteine 
corresponding to Cys-1 of a mature hedgehog. Deletions in more than the about the first 

10 12 contiguous amino acid residues do not generate functional antagonists. Preferably, 
deletions of about 10 contiguous amino acids will provide suitable functional 
antagonists. One can, however, remove fewer than 10 contiguous residues and still 
maintain antagonist function. Moreover, one can delete various combinations of non- 
contiguous residues provided that there are at least about 3 deleted residues in total. 

15 These structures highlight the importance of the N-temadnus of hedgehog 

proteins for function and indeed, underscore the need to conjugate a hedgehog protein at 
a site other than the N-terminal cj^eine. All of the N-terminal deletion variants were 
indistinguisliable from mature Sonic hedgehog (Shh) in their ability to bind patched-1, 
but were inactive in the ui vitro C3H10T1/2 AP induction assay. All these N-terminal 

20 variants are unable to promote hedgehog-dependent signaling. 

3. N-terminal mutations 

Yet another functional antagonist has a mutation of the N-terminal cysteine to 
another amino acid residue. Any non-hydrophobic amino acid residue may be 
acceptable and persons having ordinary skill in the art following the teachings described 

2s herein wiU be able to perform the mutations and test the effects of such mutations. One 
example is Shh in vsdiich the N-terminal cysteine is replaced >vith a serine residue. This 
mutated form is indistinguishable from mature Shh in its ability to bind patched-1, but it 
blocks AP induction by mature Shh when tested for function in the C3H10T1/2 AP 
induction assay. Replacements with aspartic acid, alanine and histidine have also shown 

30 to serve as antagonists. 
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4. N-temiinal cysteine modificatioiis 

Becaizse the piimaiy amino acid sequence of hedgehog contains the Cys-1 that is 
important for biological activity, certain other modifications will result in inactive 
antagonist variants of hedgehog protein. Another antagonist is an isolated fimctional 

5 antagonist of a hedgehog polypeptide, comprising a hedgehog polypeptide containing an 
N-terminal cysteiQe that corresponds to Cys-1 of a mature Sonic hedgehog, except that 
the cysteine is in a modified form. Antagonist polypeptides of hedgehog may have non- 
sequence modifications that include in vivo or in vitro chemical derivatization of their 
N-terminal cysteine, as well as possible changes in acetylation, methylation, 

10 phosphorylation, amidalion, or carboxylation. As an example, the functional antagonist 
can have an N-terminal cysteine in an oxidized form. Thus, a functional antagonist can 
have an N-terminal cysteine that is effectively modified by including it as part of an N- 
t^minal extension moiety. 

The functional antagonist polypeptides can include amino acid sequences that 

15 are at least 60% homologous to a hedgehog protein. The antagonist must exhibit at least 
the following functional antagonist properties: (i) the isolated protein binds the receptor 
patched-1 with an afi&nity that may be less than, but is preferably at least the same as, 
the binding of mature hedgehog protein to patched-1; and (ii) the isolated protein blocks 
alkaline phosphatase (AP) induction by mature hedgehog protein when tested in an in 

20 vitro CH3 1 OTl/2 cell-based AP induction assay. 

Antagonists useful in the present invention also include those which arise as a 
result of the existence of multiple genes, alternative transcription events, alternative 
RNA splicing events, and alternative translational and posttxanslational events. The 
polypeptide can be made entirely by synthetic means or can be expressed in systems, 

25 e.g., cidtured cells, which result in substantially the same posttranslational modifications 
present when the protein is e?q>ressed in a native cell, or in systems which result in the 
omission of posttranslational modifications present when expressed in a native cell. 

In a preferred embodiment, isolated antagonist is a polypeptide with one or more 
of the following characteristics: 
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(i) it has at least 60, more preferably 90 and most preferably 95% sequence 
identity with amino acids of SEQ ID NOS: 10-18 and 23-26; 

(ii) it either has a modified N-terminal cysteine or lacks an N-terminal cysteine 
or has an N-terminal cysteine in a position different jfirom the N-terminal cysteine 

5 corresponding to Cys-1 of the hedgehog; 

(iii) it blocks alkaline phosphatase induction by mature hedgehog in CH310T1/2 

cells; 

(iv) it binds or interacts with its receptor patched-1 with an affinity that may be 
less than, but is preferably at least the same as, the binding of mature hedgehog 

10 protein to patched-1 ; 

(v) it is unable to induce ptc-1 and gli-1 expression in vitro in CH310T1/2 cells; 

or 

(vi) it is unable to induce AP in CH3 lOTl/2 assays. 

15 B. Antibody Homologs as Antagonists 

In other embodiments, the antagonists used in the method of the invention to 
bind to, including block or coat, cell-surface hedgehog (such as vertebrate Sonic, Indian 
or Desert) and/or cell sxirface ligand for said hedgehog proteins (such as patched) is an 
anti-hedgehog and/or anti patched monoclonal antibody or antibody homolog, as 

20 defined previously. Preferred antibodies and homologs for treatment, in particular for 
human treatment, include human antibody homologs, humanized antibody homologs, 
chimeric antibody homologs. Fab, Fab', F(ab*)2 and F(v) antibody fiagments, and 
monomers or dimers of antibody heavy or light chains or mixtures thereof. Monoclonal 
antibodies against VLA-4 are a preferred binding agent in the method of the invention. 

25 The technology for producing monoclonal antibodies is well known. The 

preferred antibody homologs contemplated herein can be expressed fix>m intact or 
truncated genomic or cDNA or firom synthetic DNAs in prokaryotic or eukaryotic host 
cells. The dimeric proteins can be isolated firom the culture media and/or refolded and 
dimerized in vitro to form biologically active compositions. Heterodimers can be 

30 formed in vitro by combining separate, distinct polypeptide chains. Alternatively, 
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heterodimers can be foimed in a single cell by co-esqpiessing nucleic acids encoding 
separate, distinct polypeptide chains. See, for example, W093/09229, or U.S. Pat No. 
5,41 1,941, for several exemplary recombinant heterodimer protein production protocols. 
Currently preferred host cells include, without limitation, prokaryotes includmg E. coli, 

5 or eukaryotes including yeast, Saccharomyces, insect cells, or mammalian cells, such as 
CHO, COS or BSC cells. One of ordinary skill in the art will appreciate that other host 
cells can be used to advantage. For example, anti-hedgehog antibodies may be identified 
by immunoprecipitation of 1251-labeled cell lysates j&om hedgehog -expressing cells. 
Anti-hedgehog antibodies may also be identified by flow cytometry, e.g., by measuring 

10 fluorescent staining of cells incubated with an antibody believed to recognize hedgehog 
protein. The lymphpcytes used in the production of hybridoma cells typically are 
isolated from immimized maromals whose sera have already tested positive for the 
presence of anti-hedgehog antibodies using such screening assays. 

Typically, the immortal cell line (e.g., a myeloma cell Ime) is derived from the 

15 same mammalian species as the lymphocytes. Preferred immortal cell lines are mouse 
myeloma cell lines that are sensitive to culture medium containing hypoxantbine, 
aminopterin and thymidine ("HAT medium"). Typically, HAT-sensitive mouse 
myeloma cells are fiised to mouse splCTiocytes using 1500 molecular weight 
polyethylene glycol ("PEG 1500"). Hybridoma cells resulting from the fusion are then 

20 selected using HAT mediimi, \;^ch kills unfused and unproductively fused myeloma 
cells (unfixsed splenocytes die after several days because they are not transformed). 
Hybridomas producing a desired antibody are detected by screening the hybridoma 
culture supematants. For example, hybridomas prepared to produce anti-hedgehog or 
patched antibodies may be screened by testing the hybridoma culture supernatant for 

2S secreted antibodies having the ability to bind to a recombinant hedgehog or patched 
expressing cell line. 

To produce antibody homologs that are intact immunoglobulins, hybridoma cells 
that tested positive in such screening assays were cultured in a nutrient medium imder 
conditions and for a time sufBcient to allow the hybridoma cells to secrete the 
30 monoclonal antibodies into the culture medium. Tissue culture techniques and culture 
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media suitable for hybridoma cells are well knowa. The conditioned hybridonoia culture 
supemataat may be collected and the anti-hedgehog or patched aatibodies optionally 
further purified by well-known methods. 

Alternatively, the desired antibody may be produced by injecting the hybridoma 

5 cells into the peritoneal cavity of an unimmimized mo\ise. The hybridoma cells 
proliferate in the peritoneal cavity, secreting the antibody which accumulates as ascites 
fluid. The antibody may be harvested by withdrawii^ the ascites fluid firom the 
peritoneal cavity with a syringe. Several anti-hedgehog or patched monoclonal 
antibodies have been previously described. These anti-hedgehog or patched monoclonal 

10 antibodies and others will be useful in the metiiods of treatment according to the present 
invention. 

Fxilly human monoclonal antibody homologs against hedgehog or patched are 
another preferred binding agent which may block or coat hedgehog ligands in the 
method of the invention. In their intact form these may be prepared using in vitro- 

15 primed hunlan splenocytes, as described by Boemer et al., 1991, J. Immunol., 147, 86- 
95. Alternatively, they may be prepared by repertoire cloning as described by Persson et 
al., 1991, Proc. Nat Acad. Sci. USA, 88: 2432-2436 or by Huang and StoUar, 1991, J. 
Immunol. Methods 141, 227-236. U.S. Patent 5,798,230 (Aug. 25, 1998, "Process for 
the preparation of human monoclonal antibodies and their use") who describe 

20 preparation of himian monoclonal antibodies from human B cells. According to this 
process, human antibody-producing B cells are irmnortalized by infection with an 
Bpstein-Barr virus, or a derivative thereof, ibat e?cpresses Epstein-Bart virus nuclear 
antigen 2 (EBNA2). EBNA2 function, which is reqwred for immortalization, is 
subsequently shut off, which results in an increase in antibody production. 

25 In yet another method for producing fully human antibodies. United States 

Patent 5,789,650 (Aug. 4, 1998, "Transgenic non-himian animals for producing 
heterologous antibodies") describes transgenic non-human animeds capable of producing 
heterologous antibodies and transgenic non-human animals having inactivated 
endogenous immunoglobulin genes. !&idogenous immunoglobulin gen^ are suppressed 

30 by antisense polynucleotides and/or by antiserum directed against endogenous 
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inimunoglobulms. Heterologous antibodies ate encoded by immunoglobulin genes not 
normally found in the genome of Hiat species of non-human animal. One or more 
transgenes containing sequences of unrearranged heterologous human immunoglobulin 
heavy chains are introdiiced into a non-human animal thereby forming a transgenic 

5 animal capable of functionally rearranging transgenic immunoglobiilin sequences and 
producing a repertoire of antibodies of various isotypes encoded by human 
immunoglobulin genes. Such heterologous human antibodies are produced in B-cells 
which are thereafter immortalized, e.g., by fusing with an immortalizing cell line such as 
a myeloma or by manipulating such B-cells by other techniques to perpetuate a cell line 

10 capable of producing a monoclonal heterologous, fidly human antibody homolog. 

Large nonimmunized human phage display libraries may also be used to isolate 
high affinity antibodies that can be developed as human therapeutics usiag standard 
phage technology (Vaughan et al, 1996). 

Yet another preferred binding agent which may block or coat hedgehog ligands 

15 in the method of the invention is a humanized recombinant antibody homolog having 
anti-hedgehog or patched specificity. Following the early methods for the preparation of 
true "chimeric antibodies" (where the entire constant and entire variable regions are 
derived from different sources), a new approach was described in EP 0239400 (Winter 
et al.) whereby antibodies are altered by substitution (within a given variable region) of 

20 their complementarity determining regions (CDRs) for one species with those from 
another. This process may be used, for example, to substitute the CDRs from human 
heavy and light chain Ig variable region domaios with altanative CDRs from murine 
variable region domains. These alt^ed Ig variable regions may subsequently be 
combined with human Ig constant regions to create antibodies which are totally human 

25 in composition except for the substituted murine CDRs. Such CDR-substituted 
antibodies would be predicted to be less likely to elicit an immime response in humans 
compared to true chimeric antibodies because the CDR-substituted antibodies contain 
considerably less non-human componmts. The process for humanizing monoclonal 
antibodies via CDR "grafting" has been termed "reshaping". (Riechmann et al., 1988, 

30 Nature 332, 323-327; Verhoeyen et al., 1988, Science 239, 1534-1536). 
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Typically, complementarity detenxuxdng regions (CDRs) of a murine antibody 
aie transplanted onto the corresponding regions in a human antibody, since it is the 
CDRs (three in antibody heavy chains, three in light chains) that are the regions of the 
mouse antibody which bind to a specific antigen. Transplantation of CDRs is achieved 

5 by genetic engineering whereby CDR DNA sequences are determined by cloning of 
murine heavy and light chain variable (V) region gene segments, and are then 
transferred to corresponding human V regions by site directed mutagenesis. In the final 
stage of the process, human constant region gene segments of the desired isotype 
(usually gamma I for CH and kappa for CL) are added and the hxmianized heavy and 

10 light chain genes are co-e:}q>ressed in manunalian cells to produce soluble himianized 
antibody. 

The transfer of these CDRs to a human antibody confers on this antibody the 
antigen binding properties of the original murine antibody. The six CDRs in the murine 
antibody are mounted structurally on a V region "fi:amework" region. The reason that 

15 CDR-grafting is successful is that framework regions between mouse and human 
antibodies may have very sunilar 3-D structures with similar points of attachment for 
CDRS, such that CDRs can be interchanged. Such humanized antibody homologs may 
be prepared, as exemplified in Jones et al., 1986, Nature 321, 522-525; Riechmann, 
1988, Nature 332, 323-327; Queen et al., 1989, Proc. Nat Acad. Sci, USA 86, 10029; 

20 and Orlandi et al., 1989, Proc. Nat. Acad. Sci. USA 86, 3833. 

Nonetheless, certain amino acids within framework regions are thought to 
interact with CDRs and to influence overall antigen binding afGnity. The direct transfer 
of CDRs &om a murine antibody to produce a recombinant humanized antibody vsdthout 
any modifications of the human V region fiiameworks often results in a partial or 

25 complete loss of binding af&nity. In a number of cases, it appears to be critical to alter 
residues in the fismework regions of Ihe acceptor antibody in order to obtain binding 
activity. 

Queen et al., 1989 (supra) and WO 90/07861 (Protein Design Labs) have 
desoibed the preparation of a humanized antibody that contains modified residues in the 
30 framework regions of the acceptor antibody by combining the CDRs of a murine MAb 
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(and-Tac) with human immunoglobxilin framework and constant regions. They have 
demonstrated one solution to the problem of the loss of binding afOnity that often results 
from direct CDR transfer without any modifications of the human V region framework 
residues; their solution involves two key steps. First, the human V framework regions 

5 are chosen by computer analysts for optimal protein sequence homology to the V region 
framework of the original murine antibody, in this case, the anti-Tac MAb. In the 
second step, the tertiary structure of the murine V region is modelled by computer in 
order to visualize framework amino acid residues which are likely to interact with the 
murine CDRs and these murine amino acid residues are then superimposed on the 

10 homologous human framework. See also U.S. Patents 5,693,762; 5,693,761; 5,585,089; 
and 5,530,101 (Protein Design Labs), 

One may use a different approach (Tempest et al.,1991. Biotechnology 9, 266- 
271) and utilize, as standard, the V region framewoiks derived from NEWM and REI 
heavy and light chains respectively for CDR-grafting without radical introduction of 

IS mouse residues. An advantage of using the Tempest et al., approach to construct 
NEWM and REI based humanized antibodies is that the 3dimensional structures of 
NEWM and REI variable regions are known from x-ray crystallography and thus 
specific interactions between CDRs and V region framework residues can be modeled. 

Regardless of the approach taken, the examples of the initial humanized 

20 antibody homologs prepared to date have shown that it is not a straightforward process. 
However, even acknowledging that such framework changes may be necessary, it is not 
possible to predict, on the basis of the available prior art, which, if any, fr^imework 
residues will need to be altered to obtain frmctional humanized recombinant antibodies 
of the desired specificity. Results thus far indicate that changes necessary to preserve 

25 specificity and^or afBnity are for the most part unique to a given antibody and cazmot be 
predicted based on the humanization of a different antibody. 



C. Small Organic Molecules as Antagonists 

hi other embodiments, a hedgehog antagonist may be a small organic molecule. 
30 Such a small organic molecule may antagonize hedgehog signal transduction via an 
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interaction with but not limited to hedgehogs patched (ptc), gli, and/or smoothened. It is, 
therefore, specifically contemplated that these small moleciiles which intefere with 
aspects of hedgehogs ptc, or smoothened signal transduction activity will likewise be 
capable of inhibiting angiogenesis (or other biological consequences) in normal cells 
and/or mutant cells. Thus, it is contemplated that in certain embodiments, these 
compounds may be useful for inhibiting hedgehog activity in normal cells. In other 
embodiments, these compounds may be usefid for inhibitng hedgehog activity in 
abnormal celis^ In preferred embodiments, the subject inhibitors are organic molecviles 
having a molecular weight less than 2500 amu, more preferably less than ISOO amu, and 
even more preferably less than 750 amu, and are capable of antagonizing hedgehog 
signaling, preferably specifically in target cells. 

For example, compounds useful iu the subject methods include compounds may 
be represented by general forumla (T): 




wherein, as valence and stability permit, 

Rl and R2, independently for each occurrence, represent H, lower alkyl, aryl 
(e.g., substituted or unsubstituted), aralkyl (e.g., substituted or unsubstituted, e.g., - 
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(CH2)naiyl), or heteroaiyl (e.g., substituted or unsubstituted), or heteroaralkyl (e.g,, 

substituted or unsubstituted, e.g., -(CH2)nbeteroaralkyl-); 

« 

L, independently for each occuirence, is absent or represents -(CH2)n-all!!yl, - 
alkenyK -alkynyl-, -(CH2)nalkenyK -(CH2)iialkynyl-, -(CH2)nO(CH2)p-, - 
(CH2)nNR2(CH2)p-. -(CH2)nS(CH2)p-, -(CH2)nalkenyl(CH2)p-> 

(CH2)nalkynyl(CH2)p-, -0(CH2)n-, -NR2(CH2)n-, or -S(CH2)nS 

Xi and X2 can be selected, independently, from -NCRg)-, -0-, -S-, -Se-, -N=N-, - 
ON=CH-, -(R8)N-N(R8)-, -ON(Rg)-, a heterocycle, or a direct bond between L and Yi 
or Y2, respectively; 

Yl and Y2 can be selected, independently, from -C(=0)-, -C(=S)-, -8(02)-, - 
S(0)-, -C(=NCN)-, -P(=0)(0R2)", a heteroaromatic group, or a direct bond between Xi 
and Zx or X2 and 7/i^ respectively; 

Zi and Z2 can be selected, independently, from -NCRg)-, -0-, -S-, -Se-, -N=N-, - 
ON=CH-, -RgN-NRg-, -ONR3-, a heterocycle, or a direct bond between Yi or Y2, 
respectively, and L; 

Rg, independently for each occurrence, represents H, lower alkyl, -(CH2)naryl 
(e.g., substituted or unsubstituted), -(CH2)nbeteroaryl (e.g., substituted or unsubstituted), 
or two Rg taken together may form a 4- to 8-membered ring, e.g., with Xj and Z^ or X2 
and Zj , which ring may include one or more carbonyls; 

p represents, independently for each occurrence, an integer from 0 to 10, 
preferably from 0 to 3; and 

n, individually for each occurence, represents an integer from 0 to 10, preferably 
from 0 to 5. 

In certain embodiments, R^ represents a substituted or xmsubstituted heteroaryl 

group. 

In certain embodiments, X^ and X2 can be selected from -N(R8)-, -0-, -S-, a 
direct bond, and a heterocycle, Y^ and Y2 can be selected from -C(=0)-, *C(=S)-, and - 
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S(02)-, and Zi or Z2 can be selected from -NORg)-, -0-, -S-, a direct bond, and a 
heterocycle. 

In certain related embodiments, Xj-Yj-Zj or X2-Y2-Z2 taken together represents 

a urea (N-C(O)-N) or an amide (N-C(0) or C(O)-N). 

In certain embodiments, or X2 represents a diazacarbocycle, such as a 

piperazine. 

In certain embodiments, represents a fused cycloalkyl-aryl or cycloalkyl- 
heteroaiyl system, for example: 




vdierein W is a substituted or unsubstituted aiyl or heteroaiyl ring fused to the 
cycloalkyl ring and m is an integer from 1-4 inclusive, e.g., from 1-3, or fix)m 1-2. The 
fused system may be bound to L from any carbon of the fused system, including the 
position depicted above. In certain embodiments, may represent a tetrahydronaphthyl 
group, and preferably Yi-Xj-L-Ri taken together represent a tetrahydronaphthyl amide 
group, such as: 




In embodiments wherein Y^ and Z^ are absent and X^ comprises a pyrimidone, 
compounds useful in the present invention may be represented by general formula (E): 
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O 




L 



i© 

N 



X 





Formula n 



wherein, as valence and stability permit, 

Rj and R2, independently for each occurrence, represent H, lower alkyl, - 

(CH2)iiaiyl (e.g., substituted or unsubstituted), or -(CH2)iiheteroaiyl (e.g., substituted or 

unsubstituted); 

L, independently for each occurrence, is absent or represents -(CH2)ii-alkyl, - 
alkenyK -alkynyl-, -(CH2)nalkenyl-, -(CH2)nalkynyl-, -(CH2)nO(CH2)p-, - 
(CH2)nNR2(CH2)p-, -(CH2)nS(CH2)p-, -(CH2)nalkenyl(CH2)p-, 

(CH2)nalkynyl(CH2)p-» -0(CH2)n-, -NR2(CH2)n-, or -S(CH2)nS 

X can be selected from -NCRg)-, -0-, -Se-, -N=N-, -ON=CH-, -(R8)N-N(R8)- 

, -ONCRg)-, a heterocycle, or a direct bond between L and Y; 

Y can be selected from -C(=0>, -C(=S)-, -8(02)-, -S(0)-, -C(=NCN)-, - 
P(=0)(0R2)-, a heteroaromatic groi^, or a direct bond between X and Z; 

Z can be selected from -NCRg)-, -0-, -S-, -Se-, -N=N-, -ON=CH-, -RgN-NRg-, - 
ONR3-, a heterocycle, or a direct bond between Y and L; 

Rg, independently for each occurrence, represents H, lower alkyl, -(CH2)naryl 
(e.g., substituted or unsubstituted), -(CH2)nheteroaryl (e.g., substituted or unsubstituted), 
or two Rg taken together may form a 4- to 8-membered ring, e.g., with X and Z, which 
ring may include one or more carbonyls; 
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W represents a substituted or unsiibsituted aryl or heteroaryl ring fused to the 
pyrimidone ring; 

p represents, independently for each occurrence, an integer from 0 to 10, 
preferably from 0 to 3; and 

n, individually for each occurence, represents an integer from 0 to 10, preferably 
from 0 to 5. 

In embodiments wherein Yj and are absent and Xj comprises a pyrimidone, 

compounds useful in the present invention may be represented by general formula (TO): 

o 




R2 

Formula HI 



wherein, as valence and stability permit, 

Rl and R2, independently for each occurrence, represent H, lower alkyl, aryl 

(e.g., substituted or unsubstituted), aralkyl (e.g., substituted or imsubstituted, e.g., - 
(CH2)naryl), or heteroaryl (e.g,, substituted or unsubstituted), or heteroaralkyl (e.g., 
substituted or unsubstituted, e.g., -(CH2)nheteroaralkyl-); 

L, independently for each occurrence, is absent or represents -(CH2)n-ali^l» - 
alkenyl-, -alkynyl-, ^CH2)aalkenyl-, -(CH2)nalkynyl-, -(CH2)nO(CH2)p-, - 
(CH2)nNR2(CH2)p-, -(CH2)nS(CH2)p-, -(CH2)nalkenyl(CH2)p-, 

(CH2)nalkynyl(CH2)p-, -0(CH2)n-, -NR2(CH2)n-, or -S(CH2)n-, which may optionally 
be substitued with a groixp selected from H, substituted or unsubstituted lower alkyl. 



-50- 



wo 01/98344 



PCT/USOl/19435 



alkenyl, or alkynyl, cycloalkylalkyl (e,g,, substituted or unsubstituted, e.g., - 
(CH2)nCycloalkyl), (e.g., substituted or unsubstituted), aiyl (e.g., substituted or 
unsubstituted), aralkyl (e.g., substituted or unsubstituted, e.g., -(CH2)naryl). or 
heteroaryl (e.g., substituted or unsubstituted), or heteroaralkyl (e.g., substituted or 
5 unsubstituted, e.g., -(CH2)nheteroaralkyl-), preferably from H, lower alkyl, -iCH^j^axyl 
(e.g., substituted or unsubstituted), or -(CH2)nheteroaryl (e.g., substituted or 
unsubstituted); 

X can be selected from -N(R8)-, -0-, -S-, -Se-, -N=N-, -ON=CH-, -(R8)N.N(R8)- 
, -ONORg)-, a heterocycle, or a direct bond between L and Y; 
10 Y can be selected fiom -C(-OK -C(=S)-, -S(02)-, -S(0)-, -C(=NCN)-, - 

P(=0)(0R2)-, a heteroaromatic group, or a direct bond between X and Z; 

Z can be selected from -N(R8)-, -0-, -S-, -Se-, -N=N-, -ON=CH-, -RgN-NRg-, - 
ONRg-, a heterocycle, or a direct bond between Y and L; 

Rg, independently for each occurrence, represents H, lower alkyl, aiyl (e.g., 
15 substituted or unsubstituted), aralkyl (e.g., substituted or unsubstituted, e.g., - 
(CH2)naryl)» or heteroaryl (e.g., substituted or unsubstituted), or heteroaralkyl (e.g., 
substituted or unsubstituted, e.g., -(CH2)nheteroaralkyl-), or two Rg taken together may 
form a 4- to 8-membered ring, e.g., with X and Z, which ring may include one or more 
carbonyls; 

20 W represents a substituted or unsubsituted aryl or heteroaryl ring fused to the 

pyrimidone ring; 

p represents, independently for each occurrence, an integer from 0 to 10, 
preferably from 0 to 3; and 

n, individually for each occurrence, represents an integer from 0 to 10, 
25 preferably from 0 to 5. 

In certain embodiments, R^ represents a substituted or unsubstituted aryl or 
heteroaryl group, e.g., a phenyl ring, a pyridine ring, etc. In certain embodiments 
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wherein -LRi represents a substituted aiyl or heteroaryl group, Ri is preferably not 
substituted with an isopropoxy (Me2CHO-) group. In certain embodiments wherein - 
LRi represents a substituted aryl or heteroaryl group, Ri is preferably not substituted 
with an ether group. In certain embodiments, substituents on Ri (e.g., other than 
hydrogen) are selected from halogen, cyano, alkyl, alkenyl, alkynyl, aryl, hydroxyl, 
(unbranched alkyl-O-), silyloxy, amino, nitro, thiol, amino, imino, amido, phosphoryl, 
phosphonate, phosphine, carbonyl, carboxyl, carboxamide, anhydride, silyl, thioether, 
alkylsulfonyl, arylsulfonyl, sulfoxide, selenoether, ketone, aldehyde, ester, or -(CH2)m- 

Rg. In certain embodiments, non-hydrogen substituents are selected from halogen, 

cyano, aUcyl, alkenyl, alkynyl, aryl, nitro, thiol, imino, amido, carbonyl, carboxyl, 
anhydride, thioether, alkylsulfonyl, arylsulfonyl, ketone, aldehyde, and ester. In certain 
embodiments, non-hydrogen substituents are selected from halogen, cyaxxo^ alkyl, 
alkenyl, alkynyl, nitro, amido, carboxyl, anhydride, alkylsulfonyl, ketone, aldehyde, and 
ester. 

In certain embodiments, X can be selected from -NCRg)-, -0-, -S-, a direct bond, 
and a heterocycle, Y can be selected from -C(=0)-, -C(=S)-, and -S(02)-, and Z can be 
selected from -NCRg)-, -0-, -S-, a direct bond, and a heterocycle. In certain such 

embodiments, at least one of Z and X is present. 

In certain related embodiments, X-Y-Z taken together represents a urea 
(NC(0)N) or an amide CNC(0) or C(O)N). 

In certain embodiments, W is a substituted or unsubstituted benzene ring. 

In certain embodiments, X represents a diazacarbocycle, such as a piperazine, 
e.g., substituted or unsubstituted. 

In certain embodiments, X can be selected from -N(Rg)-, -0-, -S-, and a direct 

bond, Y can be selected from -C(=0)-, -C(=S)-, and -S(02)-, and Z can be selected from 

-N(Rg)-, -0-, -S-, and a direct bond, such that at least one of X and Z is present 

In certain embodiments R« represents H, lower alkyl, aralkyl, heteroaralkyl, aryl, 
or heteroaryl, e.g., H or lower alkyl. 

In certain embodiments, X rqnresents -NH-. 
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In certain embodiments, -L-X- represents -(unbranched lower alkyi)-NH-, e.g., - 
CH2-NH-, -CH2CH2-NH-, etc. 

In certain other embodiments, compounds useful in the subject methods include 
compounds may be represented by general forumla (IV): 




X LR2 

Formula IV 
wherein, as valence and stability permit, 

Rl and R2, independently for each occurrence, represent H, substituted or 

unsubstituted lower alkyl, alkenyl, or alkynyl, -(CH2)nCycloalkyl (e.g., substituted or 

unsubstituted), -(CH2)iiaryl (e.g., substituted or unsubstituted), or -(CH2)nheterocyclyl 

(e.g., substituted or unsubstituted); 

L, independently for each occurrence, is absent or represents -(CH2)n-alkyl, - 
alkenyl-, -alkynyl-, -(CH2)nalkenyl-, -(CH2)nalkynyl-, -(CH2)nO(CH2)p-, - 
(CH2)nNR2(CH2)p-, -(CH2)nS(CH2)p-, -(CH2)nalkenyl(CH2)p-, 

(CH2)nalkynyl(CH2)p-, -0(CH2)n-> -NR2(CH2)n-, or -S(CH2)nS 

V represents N or CH; 

W, independently for each occurrence, represents N or CH, such that preferably 
no more than one occurrence of W represents N; 

X and Z, independentiy, can be selected from -CH-, -N(Rg)-, -0-, -S-, or -Se-; 

Y can be selected from -C(=0)-, -C(=S)-, -S(02>, -S(0)-, -C(=NC3M)-, or - 
P(=0)(0R2)-; 

Rg, independently for each occurrence, represents H, substituted or xmsubstituted 
lower alkyl, -(CH2)nCycloalkyl (e.g., substituted or unsubstituted), -(CH2)iiaiyl (e.g., 
substituted or unsubstituted), -(CH2)nheterocyclyl (e.g., substituted or vmsubstituted), or 
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two Rg takea together may form a 4- to 8-membered ring, e.g., with and Zx or X2 
and Zj, which ring may include one or more carbonyis; 

R3 and R4, independently represent from 1-4 substituents on the ring to which 
they are attached, selected from, independently for each occurrence, hydrogen, 
halogens, alkyls, alkenyls, alkynyls, aryls, hydroxyl, =0, =S, alkoxyl, silyloxy, amino, 
nitro, thiol, amines, imines, amides, phosphoryls, phosphonates, phosphines, carbonyis, 
carboxyls, carboxamides, anhydrides, silyls, ethers, thioeHiers, alkylsulfonyls, 
arylsulfonyls, selenoethers, ketones, aldehydes, esters, or -(CH2)in-R8; 

m represents an integer from 0-3; 

p represents, independently for each occurrence, an integer from 0 to 10, 
preferably from 0 to 3; and 

n, individually for each occurence, represents an integer from 0 to 10, preferably 
from 0 to 5. 

In certain embodiments, Ri and R2 are independently selected from substituted 
or wisubstituted aryl, heterocyclyl, branched or unbranched alkyl, or cycloalkyl. In 
embodiments wherein Ri or R2 is aryl or heterocyclyl, substituents are preferably 
selected from H, alkyl, acyl, carboxy, ester, amide, cyano, ether, thioether, amino, 
halogen, nitro, and trihalomethyl. 

In c^lain embodiments, R3 is absent or represents one or two substituents 
selected from alkyl, acyl, carboxy, ester, amide, cyano, ether, thioether, amino, acyl, 
halogen, nitro, and trihalomethyl. 

In certain embodiments, R4 is absent or represents one or two substituents 
selected from ether, amino, thioether, alkyl, aryl, (=0), or carbonyl (e.g., carboxy, ester, 
ketone, aldehyde, etc.). 

In obtain embodiments, L is absent for each occurrence, or represents -CH2- or - 
CH2CH2-. 

In certain embodiments, X represents NR«. R% preferably represents H. 
In certain mibodiments^ Z represents NR«. R« preferably represents H. 
In cCTtain embodiments, Y represents -C(=0)-, -C(=S)-, or -8(02)-- 
hi certain embodiments, m is 1. 
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In certain embodiments, W represents CH in all occurrences. 
In certain embodiments, V represents N. 

In certain embodiments, compounds useful in the present invention may be 
5 represented by general formula (V): 



/ 

'Z 




'Z 



Formula V 
wherein, as valence and stability permit, 

Rl and R2, independently for each occurrCTice, represent H, substituted or 

10 unsubstituted low«r alkyl, alkenyl, or alkynyl, -(CH2)nCycloalkyl (e.g., substituted or 
unsubstituted), -(CH2)naiyl (e.g., substituted or unsubstituted), or -(CH2)nheterocyclyl 
(e.g., substituted or xmsubstituted); 

L, independently for each occurrence, is absent or represents -(CH2)n-alkyl, - 
alkenyl-, -alkynyl-, -(CH2)nalkenyl-, .(CH2)nalkynyl-, -(CH2)nO(CH2)p-, - 
15 (CH2)nNR2(CH2)p-, -(CH2)nS(CH2)p-, -(CH2)iialkenyl(CH2)p-, 

(CH2)nalkynyl(CH2)p-, -0(CH2)n-3 -NR2(CH2)n-, or -S(CH2)nS 

X and Z, independently, can be selected from -CH-, -N(R8>, -0-, -S-, or -Se-; 

Y can be selected from -C(=0)-, -C(=S)-, -8(02)-, -S(0)-, -C(=NCN)-, or - 
P(=0)(0R2)-; 

20 Rg, independently for each occiurence, represents H, substituted or unsubstituted 

lower alkyl, -(CH2)ncycloalkyl (e.g., substituted or imsubstituted), -(CH2)naryl (e.g., 
substituted or unsubstituted), -(CH2)i,heterocyclyl (e.g., substituted or unsubstituted), or 
two Rg taken together may form a 4- to 8-membered ring, e.g., with X| and or X2 
and Z^, which ring may include one or more carbonyls; 
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Ra and K4, independently represent from 1-4 substituents on the ring to which 
they are attached, selected from, independently for each occurrence, hydrogen, 
halogens, alkyls, alkenyls, alkynyls, aryls, hydroxyl, =0, =S, alkoxyl, silyloxy, amino^ 
nitro, thiol, aioines, imines, amides, phosphoryls, phosphonates» phosphines, carbonyls, 
5 carboxyls, carboxamides, anhydrides, silyls, ethers, fhioethers, alkylsulfonyls, 
arylsnlfonyls, selenoethers, ketones, aldehydes, esters, or -(CH2)m-R«; 

p represents, independently for each occurrence, an integer from 0 to 10, 
preferably from 0 to 3; and 

n, individually for each occurence, represents an integer from 0 to 10, preferably 
10 from 0 to 5. 

hi certain embodiments, Ri and R2 are independently selected from substituted 
or unsubstituted aiyl, heterocyclyl, branched or unbranched alkyl, or cycloalkyl. In 
embodiments wherein Ri or R2 is aryl or heterocyclyl, substituents are preferably 
selected from H, alkyl, acyl, carboxy, ester, amide, cyano, ether, thioether, amino, 
15 halogen, nitro, and trihalomethyl. 

hi certsan embodiments, R3 is absent or represents one or two substituents 
selected from alkyl, acyl, carboxy, ester, amide, cyano, ether, thioether, amino, acyl, 
halogen, nitro, and trihalomethyl. 

hi certain embodiments, R4 is absent or represents one or two substituents 
20 selected from ether, amino, thioether, alkyl, aryl, (=0), or carbonyl (e.g., carboxy, ester, 
ketone, aldehyde, etc.). 

In certain embodiments, L is absent for each occurrence, or represents -CH2- or - 
CH2CH2-. 

In certain embodiments, X represents NRs. R« preferably represents H. 
23 In certain embodiments, Z r^resents NRg. Rg preferably represents H. 

In certain embodiments, Y represents -C(=70>, -C(==S)-, or -8(02)-. 

In still other embodiments, compounds which may be usefril in the subject 
methods include compounds may be represented by general formula (VT): 
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Formula VI 
wherein, as valence and stability permit, 

Rl^ R2, R3, and R4, independently for each occxirrence, represent H, lower alkyl, 

-(CH2)naiyl (e.g., substituted or unsubstituted), or -(CH2)nheteroaiyl (e.g., substituted or 

unsubstituted); 

L, independently for each occurrence, is absent or represents -(CH2)n-» -alkenyl-, 
-alkynyl-, -(CH2)nalkenyl-, -(CH2)nalkynyl-, -(CH2)nO(CH2)p-, -(CH2)nNR8(CH2V, - 
(CH2)nS(CH2)p-, -(CH2)nalkenyl(CH2)p-> -(CH2)nalkynyl(CH2)p-. -0(CH2)n-. - 
NR8(CH2)n-, or -S(CH2)nS 

X and D, indqpendently, can be selected from -N(R8)-, -0-, -S-, -(R8)N-N(R8)-, 

-ONCRg)-, or a direct bond; 

Y and Z, independently, can be selected from O or S; 

E represents O, S, or NR5, \siierein R5 represents LRs or -(C=0)LR8. 

Kg, independently for each occurrence, represents H, lower alkyl, -(CH2)naryl 

(e.g., substituted or xmsubstituted), -(CH2)iiheteroaryl (e.g., substituted or unsubstituted), 
or two Rg taken togedier may form a 4- to 8-memb^ed ring; 

p represents, independ^tly for each occurrence, an integer from 0 to 10, 
preferably from 0 to 3; 

n, individually for each occurrence, represents an integer from 0 to 10, 
preferably from 0 to 5; and 

q and r represent, independently for each occurrence, an integer from 0-2. 
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In certain embodiments, D does not represent N-lower alkyl. In certain 
embodiments, D represents an aralkyl- or heteroaralkyl-substituted amine. 

In certain embodiments, Ri represents a lower alkyl group, such as a branched 
alkyl, a cycloalkyl, or a cycloalkylalkyl, for example, cyclopropyl, cyclopropylmethyl, 
5 neopentyl, cyclobutyl, isobutyl, isopropyl, sec-butyl, cyclobutyknethyl, etc. 

In certain embodiments, Y and Z are O. 

In certain embodiments, the sum of q and r is less than 4, e.g., is 2 or 3. 
In certain embodiments, XLR4, taken together, include a cyclic amine, such as a 
pipera2dne, a morpholine, a piperidine, a pyirolidine, etc. 
10 In certain embodiments, at least one of Ri, R2, and R3 includes an aiyl or 

heteroaryl group. In certain related embodiments, at least two of Ri, R2, and R3 include 
an aryl or heteroaryl group. In certain embodiments, Ri is lower alkyl. 

In certain embodiments, L attached to Ri represents O, S, or NR«, such as NH. 
In certain embodiments, E is NRg. In certain embodiments, E repres^ts an 
15 aralkyl- or heteroaralkyl-substituted amine, e.g., including polycyclic Rg. 

In certain embodiments, X is not NH. In certain embodiments, X is included in a 
ring, or, taken together with -C(==Y)-, represents a tertiary amide. 

In certain embodiments, compounds useful in the present invention may be 
20 represented by general formula (VJI): 




Formula Vn 



wherein, as valence and stability permit. 
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Rj^ R2, R39 R4, Rg* L, X, Y, Z, n, p, q, and r aie as defined above; 

M is absent or represents L, -SO2L-, or -(C=0)L-; and 

s represents, independently for each occurrence, an integer from 0-2. 

In certain embodiments, Y and Z are O. 

In certain embodiments, Ri represents a lower alkyl group, such as a branched 
alkyl, a cycloalkyl, or a cycloalkylalkyi, for example, cyclopropyl, cyclopropylmethyl, 
neopentyl, cyclobutyl, isobutyl, isopropyl, sec-butyl, cyclobutylmethyl, etc. 

In certain embodiments, the sum of q, r, and s is less than 5, e.g., is 2, 3, or 4. 

In certain embodiments, XLR4, taken together, include a cyclic amine, such as a 
piperazdne, a moipholine, a piperidine, a pyrrolidine, etc. 

In certain embodiments, L attached to Ri represents O, S, or NR«, such as NH. 

In certain embodiments, at least one of Ri, R2, and R3 includes an aryl or 
heteroaryl group. In certain related embodiments, at least two of Ri, R2, and R3 include 
an aryl or heteroaryl group. 

In certain embodiments, M is absent. 

In certain embodiments, X is not NH, In certain embodiments, X is included in a 
ring, or, taken together witii -C(=Y)-, represents a tertiary amide. 

In certain embodiments, compounds useful in the present invention may be 
represented by general formula (VIII): 




FommlaVm 



wherein, as valence and stability permit. 
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Rl^ R2, R3, R4, Rgj L, M, X, Y, Z, n, p, q, and r are as defined above. 
In certain embodiments, Y and Z are O. 

In certain embodiments, Ri represents a lower alkyl group, preferably a branched 
alkyl, a cycloalkyl, or a cycloalkylalkyl, for example, cyclopropyl, cyclopropylmethyl, 
neopentyl, cyclobulyl, isobiityl, isopropyl, sec-butyl, cyclobutylmethyl, etc. 

In certain embodiments, the sum of q and r is less than 4, e.g., is 2 or 3. 

In certain embodiments, XLR4, taken together, include a cyclic amine, such as a 
piperazine, a moipholine, a piperidine, a pyrrolidine, etc. 

In certain embodiments, at least one of Ri, R2, and R3 includes an aryl or 
heteroaryl group. In c^lain related embodiments, at least two of Ri, R2, and R3 include 
an aiyl or heteroaryl group. In certain embodiments, Ri is lower alkyl. 

In certain embodiments, L attached to Ri represents O, S, or NRs, such as NH. 

In certain embodiments, M is absent. 

In certain embodiments, X is not NH. In certain embodiments, X is included in a 
ring, or, taken together with -C(=Y)-, represents a tertiary amide. 

In certain embodiments, compoimds useful in the present invention may be 
represented by general formula QX): 



R4LX 




Formula DC 



herein, as valence and stabiUty permit, 

Rl^ R2, R3, R4, Rg* L, M, X, n, and p are as defined above. 
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In certain embodiments, XLR4, taken together, include a cyclic amine, such as a 
piperazine, a morpholine, a piperidine, a pyrrolidine, etc. 

In certain embodiments, Ri represents a lower alkyl group, preferably a branched 
alkyl, a cycloalkyl, or a cycloalkylalkyl, for example, cyclopropyl, cyclopropylmethyl, 
neopentyl, cyclobutyl, isobutyl, isopropyl, sec-butyl, cyclobutylmethyl, etc. 

In certain embodiments, at least one of Ri, R2, and R3 includes an aryl or 
heteroaiyl group. In certain related embodiments, at least two of Ri, R2, and R3 include 
an aryl or heteroaryl group. In certain embodiments, Ri is lower alkyl. 

In certain embodiments, L attached to Ri represents O, S, or NRg, such as NH. 

In certain embodiments, M is absent. 

In certain embodiments, X is not NH. In certain embodiments, X is included in a 
ring, or, taken together with -C(=Y)-, represents a tertiary amide. 

In certain embodiments L represents a direct bond for all occurrences. 

In certain embodiments, compounds useful in the present invention may be 
represented by general formula (X): 



Formula X 
wherein, as valence and stability permit, 

Y, n, p, q, and r are as defined above; 

Z' represents -C(=0>, -C(=S>, -C(=NH)-, SO2, or SO, preferably -C(=0)-, - 




C(=S>; 



V is absent or represents O, S, or NR«; 

G is absent or represents -C(=0)- or -SO2-; 
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J, independently for each occurrence, represents H or substituted or 
unsubstituted lower alkyi or alkylene, such as methyl, ethyl, methylene, ethylene, etc., 
attached to NC(=Y), such that both occurrences of N adjacent to J are linked through at 
least one occurrence of J, and 
5 R9, independently for each occurrence, is absent or represents H or lower alkyl, 

or two occurrences of J or one occurrence of J taken together with one occurrence of R9, 
forms a ring of from 5 to 7 members, which ring includes one or both occurrences of N; 

R5 represents substituted or unsubstituted alkyl (e.g., branched or unbranched), 
alkenyl (e.g., branched or unbranched), alkynyl (e.g., branched or unbranched), 
10 cycloalkyl, or cycloalkylalkyl; 

represents substituted or unsubstituted aryl, aralkyl, heteroaryl, heteroaralkyl, 
heterocyclyl, heterocyclylalkyl, cycloalkyl, or cycloalkylalkyl, including polycyclic 
groups; and 

R7 represents substituted or unsubstituted azyl, aralkyl, heteroaryl, or 
15 heteroaralkyl. 

Li certain embodiments, Y is O. In certain embodiments, Z' represents SO2, - 
C(=0)-, or -C(=S)-. 

In certain embodiments, the sum of q and r is less than 4. 
20 In certain embodiments, NJ2N, taken together, represent a cyclic diamine, such 

as a piperazine, etc., which may be substituted or unsubstituted, e.g., with one or more 
substitutents such as 0x0, lower all^l, lower allQ^l ether, etc. In certain olher 
embodiments, NJ2 or NIR9 taken together represent a substituted or unsubstituted 
heterocyclic ring to which the other occurrence of N is attached. In certain 
25 embodiments, one or both occurrences of J are substituted with one or more of lower 
alkyl, lower alkyl ether, lower alkyl thioether, amido, 0x0, etc. In certain embodiments, 
a heterocyclic ring which comprises an occurrence of J has jfrom 5 to 8 members. 

In certain embodiments, Rs represents a branched alkyl, cycloalkyl, or 
cycloalkylalkyl. 
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In certain embodiments, R6 includes at least one heterocyclic ring, such as a 
thiophene, fliran, oxazole, benzodioxane, benzodioxole, pyrrole, indole, etc. 

In certain embodiments, R7 represents a phenyl alkyl, such as a benzyl group, 
optionally substituted with halogen, hydroxyl, lower alkyl, nitro, cyano, lower alkyl 
5 ether (e.g., optionally substituted, such as CHF2CF2O), or lower alkyl thioether (e.g., 
optionally substituted, such as CF3S). 

In certain embodiments, Rg, when it occurs in V, represents H or lower alkyl, 
preferably H. 

10 In certain embodiments, compounds useful in the present invention may be 

represented by general formula (XI): 



(R9)NJ2N 




Rr VR5 
Formula XI 
wherein, as valence and stability permit, 
15 R5, R6, R7, Rg. R9, Rio, G, J, V, Y, Z', n, and p are as defined above. 

In certain embodiments, Y is O. In certain embodiments, represents SO2, - 
C(=OHor-C(=S)-. 

In certain embodiments, NJ2N, taken together, represent a heterocyclic ring, such 
20 as a piperazine, etc., which may be substituted or imsubstituted, e.g., with one or more 
substitutents such as 0x0, lower alkyl, lower alkyl ether, etc. Li certain other 
embodiments, NJ2 or NJR9 taken together represent a substituted or unsubstituted 
heterocycUc ring to which the other occurrence of N is attached. In certain 
embodiments, one or both occurrences of J are substituted with one or more of lower 
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alkyl, lower alkyl ether, low^ alkyl thioether, amido, oxo, etc. In certain embodiments, 
a heterocyclic ring which comprises an occurrence of J has from 5 to 8 members. 

In certain embodiments, R5 represents a branched alkyl, cycloalkyl, or 
cycloaUcylalkyl. 

5 In certain embodiments, includes at least one heterocyclic ring, such as a 

tbiophene, furan, oxazole, benzodioxane, benzodioxole^ pyrrole, indole, etc. 

In certain embodiments, R? represents a phenyl alkyl, such as a benzyl group, 
optionally substituted with halogen, hydroxyl, lower alkyl, nitro, cyano, lower alkyl 
ether (e.g., optionally substituted, such as CHF2CF2O), or lower alkyl thioether (e.g., 
10 optionally substituted, such as CF3S). 

In certain embodiments, Rg, when it occurs in V, represents H or lower alkyl, 
prefembly H. 

In certain preferred embodiments, the subject inhibitors inhibit hedgehog- 
mediated signal transduction with an IC50 of 1 mM or less, more preferably of 1 |jM or 

IS less, and even more preferably of 1 nM or less. 

Moreover, the subject methods can be performed on cells which are provided in 
culture {in vitro), or on cells in a whole animal (in vivo). See, for example, PCT 
publications WO 95/18856 and WO 96/17924 (the specifications of which are expressly 
hicorporated by reference herein). 

20 

V. Agonists of Hedgehog Biological Activity 

Preferred hedgehog therapeutics useful in methods of the invention are agonists 
that are derived fix>m several sources of hedgehog protein. In one embodiment, the 
agonist is not N-terminally clipped (as described above). Other embodiments of a 

25 hedgehog therapeutic suitable for the present methods are based, in part, on the 
discovery disclosed in U.S. Patetit Application No. 60/067,423 (12/3/97:PCT 
Publication that human Sonic hedgehog, e>q}ressed as a fuU-lengdi construct in either 
insect or in mammalian cells, has a hydrophobic palmitoyl group appended to the alpha- 
amine of the N-terminal cysteine. This is the first example of an extracellular signaling 

30 protein being modified in such a manna:, and, in contrast to thiol-linked palmitic acid 
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modifications whose attachment is readily reversible, this novel N-linked pahnitoyl 
moiety is likely to be very stable by analogy with myristic acid modifications. 

The agonists have at least one of the following properties: (i) the isolated protein 
binds the receptor patched-1 with an affinity that is at similar to, but is preferably higher 
than, the binding of mature hedgehog protein to patched-1; or (ii) the isolated protein 
binds to a hedgehog protein in such a way as to increase the proteins binding affinity to 
patched-1 when tested in an in vitro CH310T1/2 cell-based AP induction assay. 
Agonists of the invention may also have the additional properties of being (iii) able to 
solely induce ptc-1 and gli-1 expression. 

The preferred agonists for use in conjugation with a non-hedgehog conjugate 
(e.g., immunoglobulin or j&agment thereof) include a derivitized hedgehog polypeptide 
sequence as well as other N-terminal and/or C-terminal amino acid sequence or it may 
include all or a fragment of a hedgehog amino acid sequence. Agonist polypeptides of 
the invention include those that arise as a result of the existence of multiple genes, 
altemative transcription events, alternative KNA splicing events, and alternative 
translational and posttranslational events. The polypeptide can be made entirely by 
synthetic means or can be expressed in systems, e.g., cultured cells, which result in 
substantially the same posttranslational modifications present when the protein is 
expressed in a native cell, or in systems which result in the omission of posttranslational 
modifications present when expressed in a native cell. 

In one embodiment, the agonist is a hedgehog polypeptide with one or more of 
the following characteristics: 

(i) it has at least 30, 40, 42, 50, 60, 70, 80, 90 or 95% sequence identity with a 
hedgehog sequence such as SEQ ID NOS: 10-18 or 23-26; 

(ii) it has a cysteine or a fimctional equivalent as the N-terminal end; 

(iii) it may induce alkaline phosphatase activity in C3H10T1/2 cells; 

(iv) it has an overall sequence identity of at least 50%, preferably at least 60%, 
more preferably at least 70, 80, 90, or 95%, with a polypeptide of a hedgehog 
sequence; 

(v) it can be isolated fiom natural sources such as mammalian cells; 
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(vi) it can bind or interact with patched; and 

(vii) it may be hydrophobically-modified (i.e., it has at least one 
hydrophobicmoiety attached to the polypeptide). 



5 Increasing the overall hydrophobic nature of a hedgehog protein increases the 

biological activity of the protein. The potency of a signaling protein such as hedgehog 
can be inoreased by: (a) chemically modifying, such as by adding a hydrophobic moiely 
to, the sulfhydryl and/or to the alpha-amine of the N-terminal cysteine (see 
U.S.60/067,423); (b) replacing the N-temiinal cysteine with a hydrophobic amino acid 

10 (see U.S. 60/067,423); or (c) replacing the N-terminal cysteine with a different amino 
acid and then chemically modifying the substituted residue so as to add a hydrophobic 
moiety at the site of the substitution. 

Additionally, modification of a hedgehog protein at an internal residue on the 
surface of the protein with a hydrophobic moiety by: (a) replacing the intemal residue 

15 with a hydrophobic amino acid; or (b) replacing the intemal residue with a different 
amino acid and then chemically modifying the substituted residue so as to add a 
hydrophobic moiety at the site of the substitution will retain or enhance tiie biological 
activity of the protein. 

Additionally, modification of a piotein such as a hedgehog protein at the C- 

20 temiinus with a hydrophobic moiety by: (a) replacing the C-terminal residue with a 
hydrophobic amino acid; or (b) replacing the C-tenninal residue with a different amino 
acid and then chemically modifying the substituted residue so as to add a hydrophobic 
moiefy at the site of the substitution, will retain or enhance the biological activity of the 
protein. 

25 For hydrophobically-modified hedgehog obtained by chemically modifying the 

soluble, unmodified protein, palmitic acid and other lipids can be added to soluble Shh 
to create a lipid-modified forms with increased potency in the C3HIOT1/2 assay. 
Another form of protein encompassed by the invention is a protein derivatized with a 
variety of lipid moieties. The principal classes of lipids that are encompassed within this 

30 invention are fatty acids and sterols (e.g., cholesterol). Derivatized proteins of the 
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invention contain fatty acids which are cyclic, acyclic (i.e.» straight chain), saturated or 
unsaturated, mono-carboxylic acids. Exemplary saturated fatty acids have the generic 
formula: CHS (CH2)n COON. Table 2 below lists examples of some fatty acids that can 
be derivatized conveniently using conventional chemical methods. 

TABLE 2: Exemplary Saturated and Unsaturated Fatty Acids 



Saturated Acids: CH3 (CH2)n COOH: 

Value of n Common Name 

2 butyric acid 

4 c^roic acid 

6 caprylic acid 

8 capric acid 

10 lauric acid 

12 myristic acid* 

14 palmitic acid* 

16 stearic acid* 

1 8 arachidic acid* 

20 behenic acid 

22 lignoceric acid 

Unsaturated Acids: 

CH3CH=CHCOOH crotonic acid 

CH3(CH2)3CH=CH(CH2)7COOH myristoleic acid* 

CH3(CH2)5CH=CH (CH2)7COOH pahnitoleic acid* 

CH3(CH2)7CH=CH(CH2)7COOH oleic acid* 

CH3(CH2)3(CH2CH=CH)2(CH2)7COOH linoleic acid 

CH3(CH2CH=CH)3(CH2)7COOH linolenic acid 

CH3(CH2)3(CH2CH=CH)4(CH2)3COOH arachidonic acid 



The asterisk (*) denotes fatty acids detected in recombinant hedgehog protein secreted 
from a soluble construct (Pepinsky et al., supra). 
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Other lipids that can be attached to the protein include branched-chain fatty acids 
and those of the phospholipid group such as the phosphatidylmositols (i.e., 
phosphatidylinositol 4-monophosphate and phosphatidylinositol 4,5- biphosphate), 
5 phosphatidycholine, phosphatidylethanolamine, phosphatidylserine, and isoprenoids 
such as famesyl or geranyl groups. Lipid-modified hedgehog proteins can be purified 
from either a natural source, or can be obtained by chemically modifying the soluble, 
unmodified protein. 

For protein purified from a natural source, we showed that when full-length 
10 human Sonic hedgehog (Shh) was expressed in insect cells and membrane-bound Shh 
purified from the detergent-treated cells using a combination of SP-Sepharose 
chromatography and immunoafSnity chromatography, that the purified protein migrated 
on reducing SDS-PAGE gels as a single sharp band with an apparent mass of 20 kDa. 
See PCT The soluble and membrane-boimd Shh proteins were readily distinguishable by 
15 reverse phase HPLC, where the tethered forms eluted later in the acetonitrile gradient. 
We fbsji demonstrated that human Sonic hedgehog is tethered to cell membranes in two 
forms, one form that contains a cholesterol, and therefore is analogous to the data 
reported previously for DrosophUa hedgehog, and a second novel form that contains 
both a cholesterol and a palmitic acid modification. Both modified forms were equally 
20 as active in the C3H10T1/2 alkaline phosphatase assay, but both were about 30-times 
more potent than soluble himian Shh lacking the tether(s). The hydrophobic 
modifications did not significantly affect the apparent binding afSnity of Shh for its 
receptor, patched. 

For specific lipid-modified hedgehog obtained by chemically modifying the 
25 soluble, unmodified protein, palmitic acid and other lipids can be added to soluble Shh 
to create a lipid-modified forms with increased potency in the C3H10T1/2 assay. 
Generally, therefore, the reactive lipid moiety can be in the form of thioesters of 
saturated or unsaturated carboxylic acids such as a Coenzyme A thioesters. Such 
materials and their derivatives may include, for example, commercially available 
30 Coenzyme A derivatives such as palmitoleoyl Coenzyme A, arachidoyl Coenzyme A, 
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aiachidonoyl Coenzyme A, lauioyl Coenzyme A and fhe like. These materials are 
readily available from Sigma Chemical Company (St Louis, MO., 1998 catalog pp. 
303-306). 

There are a wide range of hydrophobic moieties with which hedgehog 

5 polypeptides can be derivatived. A hydrophobic group can be, for example, a relatively 
long chain alkyl or cycloallqrl (preferably n-alkyl) group having approximately 7 to 30 
carbons. The alkyl group may temunate with a hydroxy or primary amine "tail". To 
further illustrate, such molecules include naturally-occurring and synthetic aromatic and 
non-aromatic moieties such as &tty acids, esters and alcohols, other lipid molecules, 

10 cage structures such as adamantane and buckminsterfullerenes, and aromatic 
hydrocarbons such as benzene, perylene, phenanthrene, anthracene, naphthalene, 
pyrene, chrysene, and naphtbacene. 

Particularly useful as hydrophobic molecules are alicyclic hydrocarbons, 
saturated and xmsaturated fatty acids and other lipid and phospholipid moieties, waxes, 

15 cholesterol, isoprenoids, teipenes and polyalicyclic hydrocarbons including adamantane 
and buckminsterfullerenes, vitamins, polyethylene glycol or oligoethylene glycol, (CI- 
C18)-alkyl phosphate diesters, -0-CH2-CH(0H)-0-(C 12-C 18)-alkyl, and in particular 
conjugates with pyrene derivatives. The hydrophobic moiety can be a lipophilic dye 
suitable for use in the invention include, but are not limited to, diphenylhexatriene, Nile 

20 Red, N-phenyl-l-naphthylanune, Prodan, Laurodan, Pyrene, Perylene, rhodamine, 
rhodamine B, tetramethyhhodamine, Texas Red, sulforhodamine, l,r-didodecyl- 
3,3,3',3'tetramethylindocarbocyanine perchlorate, octadecyl rhodamine B and the 
BODIPY dyes available fix>m Molecular Probes Inc. 

Other exemplary lipophiUc moieties include aliphatic carbonyl radical groups 

25 include 1- or 2-adamantylacetyl, 3-methyladamant-l-ylacetyl, 3-methyl-3-bromo-l- 
adamantylacetyl, 1-Klecalinacetyl, camphoracetyl, camphaneacetyl, noradamantylacetyl, 
norbomaneacetyl, bicyclo[2.2.2.] -oct-5-eneacetyl, 1 -methoxybicyclo [2.2.2 .] -oct-5-ene- 
2-carbonyl, cis-5-norbomene-endo-2,3-dicarbonyl, 5-norbomen-2-ylacetyl, (lR)-( - )- 
myrtentaneacetyl, 2-norbomaneacetyl, anti-3-oxo-tricyclo[2.2.1.0^,6> ]-heptane-7- 

30 carbonyl, decanoyl, dodecanoyl, dodecenoyl, tetradecadienoyl, decynoyl or dodecynoyl. 
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1. Chemical ModijBcations of the N-tenninal cysteine of hedgehog 
If an appropriate amino acid is not available at a specific position, site-directed 
mutagmesis can be used to place a reactive amino acid at that site. Reactive amino acids 

5 include cysteine, lysine, histidine, aspartic acid, glutamic acid, serine, threonine, 
tyrosine, arginine, methionine, and tryptophan. Mutagenesis could also be used to place 
the reactive anodno acid at the N- or C-terminus or at an intemal position. 

For example, it is possible to chemically modify an N-temiinal cysteine of a 
biologically active protein, such as a hedgehog protein, or eliminate the N-terminal 

10 cysteine altogether and still retain the protein's biological activity. The rqplacement or 
modification of the N-terminal cysteine of hedgehog with a hydrophobic amino acid 
results in a protein with increased potency in a cell-based signaling assay. By replacing 
the cysteine, this approach eliroinates the problem of suppressing other unwanted 
modifications of the cysteine that can occur during the production, purification, 

15 formulation, and storage of the protein. The generality of this approach is supported by 
the finding that three different hydrophobic amino acids, phenylalanine, isoleucine, and 
methionine, each give a more active form of hedgehog, and thus, an agonist. 

This is also important for conjugation with non-hedgehog moieties (e.g., 
immunoglobulin) as described below in which we introduce two isoleucine residues to 

20 the N-terminal cysteine end of Sonic and Desert hedgehog. This effectively allows us to 
use the thiol of C-terminal cysteine as the reactive site for covalent coupling. Thus, 
replacement of the N-terminal cysteine with any other hydrophobic amino acid should 
result in an active protein. Furthermore, since we have found a correlation between the 
hydrophobicity of an amino acid or chemical modification and the potency of the 

25 corresponding modified protein in the C3HIOT1/2 assay (e.g. Phe > Met, long chain 
length fiitty acids > short chain length), it could be envisioned that adding more than one 
hydrophobic amino acid to the hedgehog sequence would increase the potency of the 
agonist beyond that achieved with a single amino acid addition. Indeed, addition of two 
consecutive isoleucine residues to the N-terminus of human Sonic hedgehog results in 

30 an increase in potency in the C3HIOT1/2 assay as compared to the mutant with only a 

-70- 



wo 01/98344 



PCT/USOl/19435 



single isoleucine added. Thus, adding hydrophobic amino acids at the N- or C-terminus 
of a hedgehog protein, in a surface loop, or some combination of positions would be 
expected to give a more active form of the protein. The substituted amino acid need not 
be one of the 20 conoanon amino acids. Methods have been reported for substituting 
unnatural amino acids at specific sites in proteins and this would be advantageous if the 
amino acid was more hydrophobic in character, resistant to proteolytic attack, or could 
be used to further direct the hedgehog protein to a particular site in vivo that would 
make its activity more potent or specific. Unnatural amino acids can be incorporated at 
specific sites in proteins during in vitro translation, and progress is being reported in 
creating in vivo systems that will allow larger scale production of such modified 
proteins. 

There are many modifications of the N-terminal cysteine which protect the thiol 
and append a hydrophobic moiety. One of skill in the art is capable of determining 
which modification is most appropriate for a particular therapeutic use. Factors affecting 
such a determination include cost and ease of production, purification and formulation, 
solubility, stability, potency, pharmacodynamics and kinetics, safety, immunogenicity, 
and tissue targeting. 

2. Chemical modification of other amino acids. 

There are specific chemical methods for the modification of many other amino 
acids. Therefore, another route for synthesizing a more active form of hedgehog would 
be to chemically attach a hydrophobic moiety to an amino acid in hedgehog other than 
to the N-terminal cysteine. If an appropriate amino acid is not available at the desired 
position, site-directed mutagenesis could be used to place the reactive amino acid at that 
site in the hedgehog structure, \^4lether at the N- or C-terminus or at ano&er positioiL 
Reactive amino acids would include cysteine, lysine, histidine, aspartic acid, glutamic 
acid, serine, threonine, tyrosine, arginine, methionine, and tryptophan. Thus the goal of 
creating a better hedgehog agonist could be attained by many chemical means and we 
do not wish to be restricted by a particular chemistry or site of modification since our 
results support the generality of this approach. 
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The hedgehog polypeptide can be linked to the hydrophobic moiety in a number 
of ways including by chemical coupling means, or by genetic engineering. To illustrate, 
there are a large number of chemical cross-linking agents that are known to those skilled 
in the art. For the present invention, the preferred cross-linking agents are 

5 heterobifunctional cross-linkers, which can be used to link the hedgehog polypeptide 
and hydrophobic moiety in a stepwise manner. Heterobifunctional cross-linkers provide 
the ability to design more specific coupling methods for conjugating to proteins, thereby 
reducing the occurrences of unwanted side reactions such as homo-protein polymers. A 
wide variety of heterobifunctional cross-linkers are known in the art. These include: 

10 succinimidyl 4-(N-maleimidomethyl) cyclohexane- 1-carboxylate (SMCC), m- 
Maleimidobenzoyl-N- hydroxysuccinimide ester (MBS); N-succinimidyl (4-iodoacetyl) 
aminobenzoate (SIAB), succinimidyl 4-(p-maleimidophenyl) butyrate (SMPB), 1-ethyl- 
3-(3-dimetbylaminopropyl) carbodiimide hydrochloride (EDC); 4- 
succinimidyloxycarbonyl- a-methyl-a-(2-pyridyldithio)-tolune (SMPT), N-succinimidyl 

15 3-(2-pyridyldithio) propionate (SPDP), succinimidyl 6-[3-(2-pyridyldithio) propionate] 
hexanoate (LC-SPDP). Those cross-linking agents having N-hydroxysuccinimide 
moieties can be obtained as the N-hydroxysulfosuccinimide analogs, which generally 
have greater water solubility. In addition, those cross-linking agents having disulfide 
bridges within the linking chain can be synthesized instead as the alkyl derivatives so as 

20 to reduce the amount of linker cleavage in vivo. 

One particularly useful class of heterobifunctional cross-linkers, included above, 
contain the primary amine reactive group, N-hydroxysuccinimide (NHS), or its water 
soluble analog N-hydroxysuIfosuccinimide (sulfo-NHS). Primary amines (lysine epsilon 
groups) at alkaline pHTs are improtonated and react by nucleophilic attack on NHS or 

25 sulfo-NHS esters. This reaction results in the formation of an amide bond, and release of 
NHS or sulfo-NHS as a by-product 

Another reactive group useful as part of a heterobifunctional cross-linker is a 
thiol reactive group. Common thiol reactive groiq}S include maleimides, halogens, and 
pyridyl disulfides. Maleimides react specifically with free sulfiiydryls (cysteine 

30 residues) in minutes, under slightly acidic to neutral (pH 6.5-7.5) conditions. Halogens 
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(iodoacetyl functions) react with -SH groups at physiological pITs. Both of these 
reactive groups result in the formation of stable thioether bonds. 

Generally, the structure of an agonistic hedgehog therapeutic useful in this 
invention is a chimeric molecule that has the general formula: X- Y-Z, where wherein X 
is a polypeptide having the amino acid sequence, or portion thereof, consisting of the 
amino acid sequence of hedgehog; Y is an optional linker moiety; and Z is a polypeptide 
comprising at least a portion of a polypeptide other than hedgehog . Preferably, X 
includes at least a biologically active N-terminal fragment of is human Sonic, Indian or 
Desert hedgehog. In the more preferred embodiments, Z is a protein witii an 19-like 
constant and/or variable domain' Most preferably, Z is at least a portion of a constant 
region of an immunoglobxilin and can be derived from an immunoglobulin of the class 
selected from IgM, IgG, IgD, IgA, and IgE. If the class is IgG, then it is selected from 
one of IgG 1, IgG2, IgG3 and IgG4. The constant region of human IgM and IgE contain 
4 constant regions (CHI, (hinge), CH2, CH3 and CH4, whereas the constant region of 
human IgG, IgA and IgD contain 3 constant regions (CHI, (hinge), CH2 and CH3. In the 
most preferred fusion proteins of the invention, the constant region contains at least the 
hinge, CH2 and CH3 domains. 

In another embodiment, the chimeric molecule has the structure D-[Sp]-B-[Sp]- 
C, where D is a non-hedgehog moiety such as described herein; [Sp] is an optional 
spacer peptide sequence; B is a hedgehog protein (which optionally may be a mutein as 
described herein); and C is an optional hydrophobic moiety linked (optionally by way of 
the spacer peptide) to the hedgehog protein D or another residue such as a surface site of 
the protein. 

The present invention provides for multimeric hedgehog therapeutic molecules. 
Such multimers may be generated by using those Fc regions, or portions thereof, of Ig 
molecules which are usually multivalent such as IgM pentamers or IgA dimers. It is 
understood that a J chain polypeptide may be needed to form and stabilize IgM 
pentamers and IgA dimers. Alternatively, multimers of hedgehog therapeutic proteins 
may be formed using a protein witii an afBnity for the Fc region of Ig molecules, such as 
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Protein A. For instance, a plurality of hedgehog / imnninoglobulin fusion proteins may 
be bound to Protein A-agarose beads. 

These multivalent forms are useful since they possess multiple hedgehog 
receptor binding sites. For example, a bivalent soluble hedgehog therapeutic may consist 

5 of two tandem repeats of those amino acids encoded by nucleic acids of SEQ. ID NOS: 
1-9 or 21, 22 or 27 (moiety X in the generic formula) separated by a linker region 
(moiety Y), the repeats boimd to at least a portion of an immunoglobulin constant 
domain (moiety Z). Alternate polyvalent forms may also be constructed, for example, by 
chemically coiq}ling chimeric hedgehog therapeutics of the invention to any clinically 

10 acceptable carrier molecule, a polymer selected fix>m the group consisting of FicoU, 
polyethylene glycol or dextran using conventional coupling techniques. Alternatively, 
hedgehog may be chemically coupled to biotin, and the biotin-hedgehog chimera then 
allowed to bind to avidin, resulting in tetravalent avidin/biotin/hedgehog molecules. 
Chimeric hedgehog proteins may also be covalently coupled to dinitrophenol (DNP) or 

15 trinitrophenol (TNP) and the resulting conjugate precipitated with anti-DNP or anti- 
TNP-IgM, to form decameric conjugates with a valency of 10 for hedgehog receptor 
binding sites 

Polymer Conjugates of Hedgehog Thempeutics 

20 One unique property of polyalkylene glycol-derived polymers of value for 

therapeutic applications of the present invention is their general biocompatibility. These 
polymers have various water solubility properties and are not toxic. They are believed 
non-immunogenic and non-antigenic and do not interfere with the biological activities 
of the hedgehog protein moiety vsdien conjugated imder the conditions described herein. 

25 They have long circulation in the blood and are easily excreted from living organisms. 

Hedgehog therapeutics are conjugated most preferably via a terminal reactive 
group on the polyalkylene glycol polymer although conjugations can also be branched 
from non-terminal reactive groups. The polymer with the reactive groiq)(s) is designated 
herein as "activated polymer". The reactive group would be expected to selectively react 

30 with free amino or other reactive groups on the hedgehog protein. In theory, the 
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activated polymer(s) are reacted so that attachment could occur at any available 
hedgehog amino group such as alpha amino groups or the epsilon-amino groups of 
lysines, or -SH groups of cysteines. Free carboxylic groups, suitably activated carbonyl 
groups, hydroxyl, guanidyl, oxidized carbohydrate moieties and mercapto groups of the 

5 hedgehog protein (if available) can also be used as attachment sites. 

In particular, the chemical modification of any N-terminal cysteine to protect the 
thiol, with concomitant conjugation witii a polyalkylene glycol moiety (i.e., PEG), can 
be carried out in numerous ways by someone skilled m the art. See United States Patent 
4,179,337. The sulfhydryl moiety, with the thiolate ion as the active species, is the most 

10 reactive functional group in a protein. There are many reagents that react &ster with the 
thiol than any other groups. See Chemistry of Protein Conjugation and Cross-Linking 
(S. S. Wong, CRC Press, Boca Raton, FL, 1991). The thiol of an N-terminal cysteine, 
such as found in all hedgehog proteins, would be expected to be more reactive than 
internal cysteines within the sequence. This is because the close proximity to tiie alpha- 

15 amine will lower the pKa of the thiol resulting in a greater degree of proton dissociation 
to the reactive thiolate ion at neutral or acid pH. In addition, the cysteine at the N- 
terminus of the structure is more likely to be exposed than the other two cysteines in the 
hedgehog sequence that are found buried in the protein structure. 

Other examples of methods that provide linkage between a polyallqrlene glycol 

20 and the N-terminal cysteine would be reactions with other alpha-haloacetyl compounds, 
organomercurials, disulfide reagents, and other N-substituted maleimides. Nvimerous 
derivatives of these active species are available commercially (e.g., ethyl iodoacetate 
(Aldrich, Milwaukee WI), phenyl disulfide (Aldrich), and N-pyrenemaleimide 
(Molecular Probes, Eugene OR)) or could be synthesized readily (e.g., N- 

25 alkyliodoacetamides, N-alkylmaleimides, and organomercurials). Another aspect to the 
reactivity of an N-traninal cysteine is that it can take part in reaction chemistries unique 
to its 1,2-aminothiol configuration. One example is the reaction with thioester groups to 
form an N-terminal amide group via a rapid S to N shift of the thioester. This reaction 
chemistry can covqple together syn&etic peptides and can be used to add single or 

30 multiple, natural or unnatural, anaino acids or other hydrophobic groups via the 
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appropriately activated peptide. Another example, is the reaction with aldehydes to form 
the thiazolidine adduct. Numerous hydrophobic derivatives of thiol esters (e.g., C2-C24 
saturated and unsaturated fatty acyl Coenzyme A esters (Sigma Chemical Co., St. Louis 
MO)), aldehydes (e.g., butyraldehyde, n-decyl aldehyde, and n-myristyl aldehyde 

5 (Aldrich)), and ketones (e.g., 2-, 3-, and 4-decanone (Aldrich)) are available 
commercially or could be synthesized readily. In a similar manner, thiomorpholine 
could be prepared from a variety of alpha-haloketone starting materials. 

Several observations suggest that the C-teraiinus or amino acids near the C- 
traninus would be preferred targets for modification with a polyalkylene glycol moiety. 

10 Briefly, we have shown that: (i) The wild-type protein is naturally modified with 
cholesterol at the C-tenninus, indicating that it is exposed and available for 
modification. Indeed, we showed that treatment with thrombin results in selective 
release of the C-tenninal 3 ammo acids (See U.S.S.N. 60/106,703, filed 11/2/98, now 
PCX Number -incorporated herein by reference); (ii) We performed extensive SAR 

IS analyses and discovered that the C-terminal 11 amino acids could be deleted without 
harmfiil effects on folding or function; (iii) We have made hedgehog/Ig fusion proteins 
by attaching an Ig moiety to the C-terminus of hedgehog without harmful effects on 
folding or fimction (data not presented here). 

While th^ is no simple chemical strategy for targeting a polyalkylene glycol 

20 polymer such as PEG to the C-tmninus of hedgehog, it is straightforward to genetically 
engineer a site that can be used to target the polymer moiety, as discussed above with 
regard to site-directed mutagenesis. For example, incorporation of a Cys at a site that is 
at or near the C-temainus allows specific modification using a maleimide, vinylsulfone 
or haloacetate- activated polyalkylene glycol (e.g., PEG). As discussed above in Section 

25 A, these derivatives can be used specifically for modification of the engineered C- 
terminal cysteines due to the hig|i selectively of these reagents for Cys. Other strategies 
such as incorporation of a histidine tag which can be targeted (Fancy et al., (1996) 
Chem. & Biol. 3: SSI) or an additional glycosyladon site, represent other alternatives 
for modifying the C-terminus of hedgehog. A single polymer molecule may be 

30 employed for conjugation with the hedgehog protein and modified versions thereof as 
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discussed above, althou^ it is also contemplated that more than one polymer molecule 
can be attached as well. Conjugated hedgehog compositions of the invention may fibad 
iitility in both in vivo as well as non-in vivo applications. Additionally, it will be 
recognized that the conjugating polymer may utilize any other groups, moieties, or other 

5 conjugated species, as appropriate to the end use application. By way of example, it may 
be useful in some applications to covalently bond to the polymer a functional moiety 
imparting UV-degradation resistance, or antioxidation, or other properties or 
characteristics to the polymer. As a further example, it may be advantageous in some 
applications to functionalize the polymer to render it reactive or cross-linkable in 

10 character, to enhance various properties or characterisics of the overall conjugated 
material. Accordingly, the polymer may contain any functionality, repeating groups, 
linkages, or other constitutent structures which do not preclude the efiScacy of the 
conjugated hedgehog composition for its intended purpose. Other objectives and 
advantages of the present invention will be more fully apparent from the ensuing 

15 disclosure and appended claims. 

Illustrative polymers that may usefully be employed to achieve these desirable 
characteristics are described herein below in exemplary reaction schemes. In covalently 
bonded peptide applications, the polymer may be functionalized and then coupled to 
free anuno acid(s) of the peptide(s) to form labile bonds. 

20 Generally from about 1.0 to about 10 moles of activated polymer per mole of 

protein is employed, depending on the particular reaction chemistry and the protein 
concentration. The final amount is a balance between maximizing the extent of the 
reaction while minimizing non-specific modifications of the product and, at the same 
time, defining chemistries that will maintain optimiun activity, while at the same time 

25 optimizing, if possible, the half-life of the protein. Preferably, at least about 50% of the 
biological activity of the protein is retained, and most preferably 100% is retained. 

The reactions may take place by any suitable method xised for reacting 
biologically active materials with inert polymers. Generally the process involves 
preparing an activated polymer (that may have at least one terminal hydroxyl groiq)) and 

30 thereafter reacting the protein with the activated polymer to produce the soluble protein 
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suitable for formulation. The above modification reaction can be perfoimed by several 
methods, which may involve one or more steps. 

Suitable methods of attachiag a polyalkylene glycol moiety to a C-temiinal 
cysteine involve using such moieties tiiat are activated with a thiol reactive group, as 

5 generally discussed above. Common thiol reactive groups include maleimides, 
vinylsulfones or haloacetates. These derivatives can be used specifically for 
modification of cysteines due to the high selectively of these reagents for -SH. 
Maleimides react specifically with free sulfliydryls (cysteine residues) in minutes, xmder 
slightly acidic to neutral (pH 6.0-7.5) conditions. This pH range is preferred although 

10 the reaction will proceed, albeit slowly, at pH 5,0, Halogens (iodoacetyl fimctions) react 
with -SH groups at physiological pHs to slightly basic conditions. Both of these reactive 
groups result in the formation of stable thioether bonds. 

In the practice of the methods of the present invention, polyalkylene glycol 
residues of C1-C4 alkyl polyalkylene glycols, preferably polyethylene glycol (PEG), or 

15 poly(oxy)alkylene glycol residues of such glycols are advantageoxisly incorporated in 
the polymer systems of interest Thus, the polymer to which the protein is attached can 
be a homopolymer of polyethylene glycol (PEG) or is a polyoxyethylated polyol, 
provided in all cases that the polymer is soluble in water at room temperature. Non- 
limiting examples of such polymers include polyall^lene oxide homopolymers such as 

20 PEG or polypropylene glycols, polyoxyethylenated glycols, copolymers thereof and 
block copolymers thereof, provided that the water solubility of the block copolymer is 
maintained. Examples of polyoxyethylated polyols include, for example, 
polyoxyethylated glycerol, polyoxyethylated sorbitol, polyoxyethylated glucose, or the. 
like. The glycerol backbone of polyoxyethylated glycerol is the same backbone 

25 occurring naturally in, for example, animals and humans in mono-, di-, and 
triglycerides. Therefore, this branching would not necessarily be seen as a foreign agent 
in the body. 

As an alternative to polyalkylene oxides, dextran, polyvinyl pyrrolidones, 
polyacrylamides, polyvinyl alcohols, carbohydrate-based polymers and the like may be 
30 used. Moreover, heteropolymers (i.e., polymers consisting of more than one species of 
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monomer such as a copolymer) as described in U.S. Patent 5,359,030 may be used (e.g., 
proteins conjugated to polymers comprising a polyalkylene glycol moiety and one or 
more fetty acids) Those of ordinary skill in the art will recognize that the foregoing list 
is merely illustrative and that all polymer materials having the qualities described herein 

5 are contemplated. The polymer need not have any particular molecular weight, but it is 
preferred that the molecular weight be between about 300 and 100,000, more preferably 
between 10,000 and 40,000. In particular, sizes of 20,000 or more are best at preventing 
protein loss due to filtration in the kidneys. Moreover, in another aspect of the invention, 
one can utilize hedgehog covalently bonded to the polymer component in which the 

10 nature of the conjugation involves cleavable covalent chemical bonds. This allows for 
control in terms of the time course over which the polymer may be cleaved fi-om the 
hedgehog. This covalent bond between the hedgehog protein drug and the polymer may 
be cleaved by chemical or en2ymatic reaction. The polymer-hedgehog protein product 
retains an acceptable amount of activity. Concurrently, portions of polyethylene glycol 

15 are present in the conjugating polymer to endow the polymer-hedgehog protein 
conjugate with high aqueous solubility and prolonged blood circulation capabiUty. As a 
restilt of these improved characteristics the invention contemplates parenteral, aerosol, 
and oral delivery of both the active polymer-hedgehog protein species and, following 
hydrolytic cleavage, bioavailability of the hedgehog protein per se, in in vivo 

20 applications. 

It is to be understood that the reaction schemes described herein are provided for 
the purposes of illustration only and are not to be limiting with respect to the reactions 
and structures which may be utilized in the modification of the hedgehog protein, e.g., 
to achieve solubility, stabilization, and cell membrane affinity for parenteral and oral 

25 administration. Generally speaking, the concentrations of reagents used are not critical 
to carrying out the procedures provided hererin except that Ihe molar amount of 
activated polymer should be at least equal to, and preferably in excess of, the molar 
amount of the reactive group (e.g., thiol) on the hedgehog amino acid(s). The reaction of 
the polymer with the hedgehog to obtain the most prefetred conjugated products is 

30 readily carried out using a wide variety of reaction schemes. The activity and stability of 
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the hedgehog protein conjugates can he varied in several ways, by using a polymer of 
diflferent molecular size. Solubilities of the conjugates can be varied by changing the 
proportion and size of the polyethylene glycol fragment incorporated in the polymer 
composition. 

3. SxnaJl Molecule Agonists 

In other embodiments, a hedgehog agonist may be a small organic molecule. 
Such a small organic molecule may agonize hedgehog signal transduction via an 
interaction with but not limited to hedgehog, patched (ptc\ glU and/or smoothened. It is, 
therefore, specifically contemplated that these small molecules which enhance or 
potentiate aspects of hedgehog, ptc, or smoothened signal transduction will likewise be 
capable of enhancing angiogenesis (or other biological consequences) in nomial cells 
and/or mutant cells. Thus, it is contemplated that in certain embodiments, these 
compounds may be useful for enhancing or potentiating hedgehog activity. In other 
embodiments, these compounds may be useful for inhibitng hedgehog activity in 
abnormal cells. In preferred embodiments, the subject agonists are organic molecules 
having a molecular weight less than 2500 amu, more preferably less than 1500 amu, and 
even more preferably less than 750 amu, and are capable of agonizing hedgehog 
signaling, preferably specifically in target cells. 

For example, agonist compounds useful in the subject methods include 
compoxmds represented by general formula (XII); 
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FonnulaXn 

wherein, as valence and stability permit. 

At and Ar' independently represent substituted or unsubstituted aryl or 
heteroaryl rings; 

Y, independently for each occurrence, may be absent or represent -N(R)-, -0-, - 
S-, or -Se-; 

X can be selected from -C(-0)-, -C(=S)-, -S(02)-, -8(0)-, -C(=NCN)-, - 
P(=0)(0R2)-, and a methylene group optionally substituted with 1-2 groups such as 

lower alkyl, alkenyl, or alkynyl groups; 

M represents, independently for each occurrence, a substituted or unsubstituted 
methylene group, such as -CH2-, -CHF-, -CHOH-, -CH(Me)-, -C(=0)-, etc., or two M 
taken together represent substituted or unsubstituted ethene or ethyne; 

R represents, independently for each occurrence, H or substituted or 
unsubstituted aryl, heterocyclyl, heteroaryl, aralkyl, heteroaraUcyl, alkynyl, alkenyl, or 
alkyl, or two R taken together may form a 4- to 8-membered rii^, e.g., with N; 

Cy and Cy* independenly represent substituted or unsubstituted aryl, 
heterocyclyl, heteroaryl, or cycloalkyl, including polycycUc groiq)s; 

i represents, independently for each occurrence, an integer from 0 to S, 
preferably from 0 to 2; and 
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n, individually for each occurence, represents an integer &om 0 to 10, preferably 
from 0 to 5. 

In certain embodiments, M represents, independently for each occurrence, a 
5 substituted or imsubstituted methylene group, such as "CH2-, -CHF-, -CHOH-, - 
CH(Me>, -C(-0)-, etc. 

In certain embodiments, Ar and Ar' represent phenyl rings, e.g., unsubstituted or 
substituted with one or more groups including heteroatoms such as O, N, and S. In 
certain embodiments, at least one of Ar and Ar' represents a phenyl ring. In certain 
10 embodiments, at least one of Ar and Ar' represents a heteroaryl ring, e.g., a pyridyl, 
thiazolyl, thienyl, pyrimidyl, etc. In certain embodiments, Y and Ar* are attached to Ar 
in a meta and/or 1,3-relationship. 

In certain embodiments, Y is absent from all positions. In embodiments wherein 
Y is present in a position, i preferably represents an integer from 1-2 in an adjacent M\ if 
15 i=0 would result in two occurrences of Y being directly attached, or an occurrence of Y 
being directly attached to N. 

In certain embodiments, Cy' is a substituted or unsubstituted aryl or heteroaryl. 
In certain embodiments, Cy' is directly attached to X. In certain embodiments, Cy' is a 
substituted or unsubstituted bicyclic or heteroaryl ring, preferably both bicyclic and 
20 heteroaryl, such as benzotiiiophene, benzofuran, benzopyrrole, benzopyridine, etc. In 
certain embodiments, Cy* is a monocycUc aryl or heteroaryl ring substituted at least 
with a substituted or unsubstituted aryl or heteroaryl ring, e.g., forming a biaryl system. 
In certain embodiments, Cy' includes two substituted or unsubstituted aryl or heteroaryl 
rings, e.g., the same or different, directly connected by one or more bonds, e.g., to form 
25 a biaryl or bicyclic ring system. 

In certain embodiments, X is selected from -C(=0)-, -C(=S)-, and -8(02)-. 
In certain embodiments, Cy represents a substituted or unsubstituted non- 
aromatic carbocyclic or heterocyclic ring, i.e., including at least one sp hybridized 
atom, and preferably a plurality of sp^ hybridized atoms. In certain embodiments, Cy 
30 includes an amine within the atoms of the ring or on a substitutent of the ring, e.g., Cy is 
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pyridyl, imidazolyl, pyrrolyl, piperidyl, pyrrolidyl, piperazyl, etc., and/or bears an amino 
substituent. . In certain embodiments, Cy is a 5- to 7-membered ring. In certain 
embodiments, Cy is directly attached to N. In embodiments wherein Cy is a six- 
membeied ring directly attached to N and bears an amino substituent at the 4 position of 
the ring relative to N, the N and amine substituents may be disposed trans on the ring. 

In certain embodiments, substitu^ts on Ar or Ar' are selected £ix)m halogen, 
lower alkyl, lower alkenyl, aryl, heteroaryl, carbonyl, thiocarbonyl, ketone, aldehyde, 
amino, acylamino, cyano, nitro, hydroxyl, azido, sulfonyl, sulfoxido, sulfete, sulfonate, 
sulfamoyl, sulfonamide, phosphoryl, phosphonate, phosphinate, -(CH2)palkyl, *- 
(CH2)palkenyl, .(CH2)palkynyl, -(CH2)paryl, -(CH2)paralkyl, -(CH2)pOH, -(CH2)pO- 
lower alkyl, -(CH2)pO-lower alkenyl, -0(CH2)nR, -(CH2)pSH, "(CH2)pS-lower allqrl, - 
(CH2)pS-lower alkenyl, -S(CH2)nR, -(CH2)pN(R)2, -(CH2)pNR-lower alkyl, - 
(CH2)pNR-lower alkenyl, -NR(CH2)nR» and protected forms of the above, wherein p, 
individually for each occurence, represents an integer from 0 to 10, preferably from 0 to 
5, 

In certain embodiments, compounds useful in the present invention may be 
represented by general formula (Xni): 
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At* 




Formula Xm 

wherein, as valence and stability permit, 

Ar and Ar' independently represent substituted or unsubstituted aryl or 
heteroaiyl rings; 

5 Y, independently for each occurrence, may be absent or represent -N(R)-, -0-, - 

S-, or -Se-; 

X can be selected jfrom -C(=0)-, -C(=S)-, -S(02)-, -S(0)-, -C(=NCN)-, - 
P(=0)(0R2)-, and a methylene group optionally substituted with 1-2 groups such as 
lower alkyl, alkenyl, or alkynyl groups; 
10 M represents, independendy for each occurrence, a substituted or imsubstituted 

metiiylene group, sudi as -CH2-, -CHF-, -CHOH-, -CH(Me)-, -C(=0)-, etc., or two M 
taken together represent substituted or unsubstituted etitiene or ethyne, wherein some or 
all occurrences of M in Mj form all or part of a cyclic structure; 

R represents, independently for each occurrence, H or substituted or 
15 imsubstituted aryl, heterocyclyl, heteroaryl, aralkyl, heteroaralkyl, alkynyl, alkenyl, or 
alkyl, or two R taken together may form a 4- to 8-membered ring, e.g., with N; 

Cy' represents a substituted or unsubstituted aryl, heterocyclyl, heteroaryl, or 
cycloalkyl, including polycyclic groiq)s; 

j represents, independently for each occurrence, an integer from 0 to 10, 
20 preferably from 2 to 7; 
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i represents, independently for each occurrence, an integer from 0 to S, 
preferably from 0 to 2; and 

n, individually for each occurence, represents an integer from 0 to 10, preferably 
from 0 to 5. 

5 

In certain embodiments, M represents, independently for each occurrence, a 
substituted or unsubstituted methylene group, such as -CH2-, -CHF-, -CHOH-, - 
CH(Me)., -C(=0)-, etc. 

In certain embodiments, Ar and Ar' represent phenyl rings, e.g., unsubstituted or 

10 substituted with one or more groups including heteroatoms such as O, N, and S. In 
certain embodiments, at least one of Ar and Ar' represents a phenyl ring. In certain 
embodiments, at least one of Ar and Ar' represents a heteroaryl ring, e.g., a pyridyl, 
thiazolyl, ttdenyl, pyrimidyl, etc. In certain embodiments, Y and Ar' are attached to Ar 
in a meta and/or 1,3 -relationship. 

15 In certain embodiments, Y is absent from all positions. In embodiments wherein 

Y is present in a position, i preferably represents an integer from 1-2 in an adjacent Mi if 
i=0 would result in two occurrences of Y being directiy attached, or an occurrence of Y 
being directly attached to N or NR2- 

In certain embodiments, Cy' is a substituted or unsubstituted aryl or heteroaryl. 

20 In certain embodiments, Cy' is directly attached to X. In certain embodiments, Cy' is a 
substituted or unsubstituted bicyclic or heteroaryl ring, preferably both bicyclic and 
heteroaryl, such as benzothiophene, benzofuran, benzopyrrole, benzopyridine, etc. In 
certain embodiments, Cy' is a monocycUc aryl or heteroaryl ring substituted at least 
with a substituted or unsubstituted aryl or heteroaryl ring, e.g., forming a biaryl system. 

25 In certain embodiments, Cy' includes two substituted or unsubstituted aryl or heteroaryl 
riugs, e.g., the sarnie or different, directly coimected by one or more bonds, e.g., to form 
a biaryl or bicyclic ring system. 

In certain embodiments, X is selected from -C(=0)-, -C(=S)-, and -8(02)-. 
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In certain embodiments, NR2 represents a primary amine or a secondary or 
tertiary amine substituted with one or two lower alkyl groups, aryl groups, or aralkyl 
groups, respectively, preferably a primary amine. 

In certain embodiments, substituents on Ar or Ar' are selected &om halogen, 
lower alkyl, lower alkenyl, aryl, heteroaryl, carbonyl, thiocarbonyl, ketone, aldehyde, 
amino, acylamino, cyano, nitro, hydroxyl, azido, sulfonyl, sulfoxido, sulfate, sulfonate, 
sulfamoyl, sulfonamido, phosphoryl, phosphonate, phosphinate, -(CH2)palkyl, - 
(CH2)palkenyl, .(CH2)palkynyl, -(CH2)paryl, .(CH2)paralkyl, -(CH2)pOH, -(CH2)pO- 
lower alkyl, -(CH2)pO-lower alkenyl, -0(CH2)nR, -(CH2)pSH, -(CH2)pS-lower alkyl, - 
(CH2)pS-lower alkenyl, -S(CH2)nR, -(CH2)pN(R)2, "(CH2)pNR-lower alkyl, - 
(CH2)pNR-lower alkenyl, -lSfR(CH2)nR, and protected forms of the above, wherein p, 
individually for each occurence, represents an integer from 0 to 10, preferably from 0 to 
5. 

In certain embodiments, compounds usefiil in the present invention may be 
represented by general formula POV): 




Ml-v^ ^^^-^ 

^Y^ ^NRa 




Formula XIV 

wherein, as valence and stability permit, 

Ar and Ar* independently represent substituted or imsubstituted aryl or 
heteroaryl rings; 
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Y, independently for each occurrence, may be absent or represent -N(R)-, -0-, - 
S-,or-Ses 

X can be selected from -C(=0)-, -C(=S)-, -S(02)-, -S(0)-, -C(=NCN)-, - 
P(=0)(0R2)-, and a methylene group optionally substituted with 1-2 groups such as 

5 lower alkyl, alkenyl, or alkynyl groups; 

M represents, independently for each occurrence, a substituted or imsubstituted 
methylene group, such as -CH2-, -CHF-, -CHOH-, -CH(Me)-, -C(=0)-, etc., or two M 
taken together represent substituted or unsubstituted ethene or ethyne; 

R represents, independently for each occurrence, H or substituted or 
10 unsubstituted aryl, heterocyclyl, heteroaryl, aralkyl, heteroaralkyl, alkynyl, alkenyl, or 
alkyl, or two R taken together may form a 4- to 8-membered ring, e.g., with N; 

Cy and Cy' independenly represent substituted or unsubstituted aryl, 
heterocyclyl, heteroaryl, or cycloalkyl, including polycyclic groups; 

i represents, independentiy for each occurrence, an integer from 0 to 5, 
15 preferably from 0 to 2; and 

n, individually for each occurence, represents an integer from 0 to 10, preferably 
from 0 to 5. 

In certain embodiments, M represents, independentiy for each occurrence, a 
20 substituted or unsubstituted methylene group, such as -CH2-, -CHF-, -CHOH-, - 
CH(Me)-, -C(=0)-, etc. 

In certain embodiments, Ar and Ar' represent phenyl rings, e.g., unsubstituted or 
substituted with one or more groups including heteroatoms such as O, N, and S. In 
cotain embodiments, at least one of Ar and Ar' represents a phenyl ring. In certain 
25 embodiments, at least one of Ar and Ar' represents a heteroaryl ring, e.g., a pyridyl, 
thiazolyl, thienyl, pyrimidyl, etc. In certain embodiments, Y and Ar' are attached to Ar 
in a meta and/or 1,3-relationship. 

In certain embodiments, Y is absent from all positions. In embodiments wherein 
Y is present in a position, i prefarably represents an integer from 1-2 in an adjacent Mi if 
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i=0 would result in two occurreiices of Y being directly attached, or an occurrence of Y 

being directly attached to N or NR2. 

In certain embodiments, Cy' is a substituted or unsubstituted aryl or heteroaryl. 

In certain embodiments, Cy* is directly attached to X. In certain embodiments, Cy' is a 
5 substituted or unsubstituted bicyclic or heteroaryl ring, preferably both bicyclic and 

heteroaryl, such as benzotfaiophene, benzofuran, ben2X)pycrole, benzopyridine, etc. In 

certain embodiments, Cy* is a monocyclic aryl or heteroaryl ring substituted at least 

with a substituted or unsubstituted aryl or heteroaryl ring, e.g., forming a biaryl system. 

In certain embodiments, Cy' includes two substituted or unsubstituted aryl or heteroaryl 
10 rings, e.g., the same or different, directly connected by one or more bonds, e.g., to form 

a biaryl or bicyclic ring system. 

In certain embodiments, X is selected from "C(=0)-, -C(=S)-, and -S(02)-. 

In certain embodiments, NR2 represents a primary amine or a secondary or 

tertiary amine substituted with one or two lower alkyl groups, aryl groups, or aralkyl 

15 groups, respectively, preferably a primary amine. 

In certain embodiments, Cy represents a substituted or unsubstituted non- 

3 * * 

aromatic carbocyclic or heterocyclic ring, i.e., including at least one sp hybridized 
atom, and preferably a plurality of sp^ hybridized atoms. In certain embodiments, Cy is 
directly attached to N and/or to NR2- In certain embodiments, Cy is a 5- to 7-membered 

20 ring. In embodiments wherein Cy is a six-membered ring directly attached to N and 
bears an amino substituent at the 4 position of the ring relative to N, the N and amine 
substituents may be disposed trans on &e ring. 

In certain embodiments, substituents on Ar or Ar' are selected from halogen, 
lower alkyl, lower alkenyl, aryl, heteroaryl, carbonyl, thiocarbonyl, ketone, aldehyde, 

25 amino, acylamino, cyano, nitro, hydroxyl, azido, sulfonyl, sulfoxide, sulfiite, sulfonate, 
sulfamoyl, sulfonamido, phosphoryl, phosphonate, phosphinate, -(CH2)palkyl, - 
(CH2)palkenyl, -(CH2)palkynyl, -(CH2)paiyl, .(CH2)paralkyl, -(CH2)pOH, -(CH2)pO- 
lower alkyl, -(CH2)pO-lower alkenyl, -0(CH2)nR, -(CH2)pSH, -(CH2)pS-lower alkyl, - 
(CH2)pS--lower alkenyl, .S(CH2)nR. -(CH2)pN(R)2, <CH2)pNR-lower alkyl, - 

30 (CH2)pNR-lower alkenyl, -NR(CH2)nR» and protected forms of the above, wherein p, 
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individually for each occurence, represents an intego: from 0 to 10, preferably from 0 to 
5. 

In certain embodiments, compoimds usefril in the subject methods include 
compounds represented by general forumla (XV): 




Cy' 

Formula XV 

wherein, as valence and stability permit, 

Cy' represents a substituted or unsubstituted aryl or heteroaryl ring, including 
polycyclics; 

Y, independently for each occurrence, may be absent or represent -N(R)-, -0-, - 
S-, or -Se-; 

X can be selected from -C(=0)-, "C(=S)-, -S(02)-, -S(0)-, -C(=NCN)-, - 
P(=0)(0R2)-, and a methylene group optionally substituted with 1-2 groups such as 
lower alkyl, alkenyl, or alkynyl groups; 

M represents, independently for each occurrence, a substituted or unsubstituted 
methylene group, such as -CH2-, -CHF-, -CHOH-, -CH(Me)-, -C(=0)-, etc., or two M 
taken togeflier repiesent substituted or imsubstituted ethene or e&yne; 

R represents, independently for each occurrence, H or substituted or 
imsubstituted aryl, heterocyclyl, heteroaryl, arallqrl, heteroaralkyl, alkynyl, alkenyl, or 
alkyl, or two R taken together may form a 4- to 8-membered ring, e.g., with N; 
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Ri and R2 represent, independently and as valency permits, from 0-S substituents 
on the ring to which it is attached, selected from halogen, lower alkyl, lower alkenyl, 
aryl, heteroaryl, carbonyl, thiocarbonyl, ketone, aldehyde, amino, acylamino, amido, 
amidino, cyano, nitro, hydroxyl, azido, sulfonyl, sulfoxido, sulfate, sulfonate, sulfamoyl, 
sulfonamide, phosphoryl, phosphonate, phosphinate, -(CH2)palkyl, -(CH2)palkenyl, - 
(CH2)palkynyl, -(CH2)paryl, -(CH2)paralkyl, -(CH2)pOH, -(CH2)pO-lower alkyl, - 
(CH2)pO-lower alkenyl, -0(CH2)nR, -(CH2)pSH, -(CH2)pS-lower alkyl, -(CH2)pS- 
lower alkenyl, -S(CH2)nR, -(CH2)pN(R)2, -(CH2)pNR-lower alkyl, -(CH2)pNR4ower 
alkenyl, -NR(CH2)jaR3 and protected forms of the above; 

Cy represents substituted or unsubstituted aryl, heterocyclyl, heteroaryl, or 
cycloalkyl, including polycyclic groups; 

i represents, independently for each occittrence, an integer from 0 to 5, 
preferably from 0 to 2; and 

p and n, individually for each occurrence, represent integers from 0 to 10, 
preferably from 0 to 5. 

In certain embodiments, M represents, independently for each occurrence, a 
substituted or unsubstituted methylene group, such as -CH2-, -CHF-, -CHOH-, - 
CH(Me)-, -C(=0)«, etc. 

In certain embodiments, Cy' represents a substituted or imsubstituted bicyclic or 
heterocyclic ring system, preferably both bicyclic and heteroaryl, such as 
benzothiophene, benzofturan, benzopyrrole, benzopyridine, etc. In certain embodiments, 
Cy* is directly attached to X. In certain embodiments, Cy* is a monocyclic aryl or 
heteroaryl ring substituted at least with a substituted or unsubstituted aryl or heteroaryl 
ring, e.g., forming a biaryl system. In certain embodiments, Cy' includes two substituted 
or imsubstituted aryl or heteroaryl rings, e.g., the same or different, directly connected 
by one or more bonds, e.g., to form a biaryl or bicyclic ring system. 

In certain embodiments, Y is absent from all positions. In embodiments wherein 
Y is present in a position, i preferably represents an integer from 1-2 in an adjacent Mi if 
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i=0 would result in two occuirences of Y being directly attached, or an occurrence of Y 
being directly attached to N. 

In certain embodiments, X is selected from -C(=0)-, -C(=S)-, and -8(02)-. 

In certain embodiments, Cy represents a substituted or imsubstituted non- 

5 aromatic carbocyclic or heterocyclic ring, i.e., including at leeist one sp hybridized 
atom, and preferably a plurality of sp^ hybridized atoms. In certain embodiments, Cy 
includes an amine within the atoms of the ring or on a substitutent of the ring, e.g., Cy is 
pyridyl, imidazolyl, pyrrolyl, piperidyl, pyrrolidyl, piperazyl, etc., and/or bears an amino 
substituent. In certain embodiments, Cy is directly attached to N. In certain 

10 embodiments, Cy is a 5- to 7-membered ring. In embodiments wherein Cy is a six- 
membered ring directly attached to N and bears an amino substituent at the 4 position of 
the ring relative to N, the N and amine substituents may be disposed trans on the ring. 

In certain embodiments, Ri and R2 represent^ independently and as valency 
permits, from 0-5 substituents on the ring to which it is attached, selected from halogen, 

15 lower alkyl, lower alkenyl, carbonyl, thiocarbonyl, ketone, aldehyde, amino, acylamino, 
cyano, nitro, hydroxyl, azido, sulfonyl, sulfoxide, siilfate, sulfonate, sulfamoyl, 
sulfonamido, phosphoryl, phosphonate, phosphinate, -(CH2)palkyl, -(CH2)palkenyl, - 
(CH2)palkynyl, -(CH2)paryl, -(CH2)paralkyl, -(CH2)pOH, -(CH2)pO-lower alkyl; - 
(CH2)pO-lower alkenyl, «.0(CH2)nR, -(CH2)pSH, -(CH2)pS-lower alkyl, -(CH2)pS- 

20 lower alkenyl, -S(CH2)nR> -(CH2)pN(R)2, -(CH2)pNR-lower alkyl, -(CH2)pNR-lower 
alkenyl, -NR(CH2)nR, and protected forms of the above, wherein p, individually for 
each occurence, represents an integer from 0 to 10, preferably from 0 to 5. 

In certain embodiments, compounds useful in the present invention may be 
25 represented by general formula CXVI): 
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wh^ein, as valence and stability pennit, 

Cy' represents a substituted or unsubstituted aryl or heteroaryl ring, including 
polycyclics; 

Y, independently for each occurrence, may be absent or represent -N(R)-, -0-, - 
or -Se-; 

X can be selected from -C(=OK -C(=S)-, -8(02)-. -S(0)-, -C(=NCN)-, - 
P(=0)(0R2)-, and a methylene group optionally substituted with 1-2 groups such as 

lower alkyl, alkenyl, or aUqmyl groups; 

M represents, independently for each occurrence, a substituted or xmsubstituted 
methylene group, such as -CH2-, -CHF-, -CHOH-, -CH(Me)-, -C(=0)-, etc., or two M 
taken together represent substituted or unsubstituted ethene or ethyne; 

R represents, independently for each occurrence, H or substituted or 
unsubstituted aryl, heterocyclyl, heteroaryl, aralkyl, heteroaralkyl, alkynyl, alkenyl, or 
alkyl, or two R taken together may form a 4- to 8-membered ring, e.g., with N; 

Ri and R2 represent, independently and as valency permits, from 0-S substituents 
on the ring to which it is attached, selected from halogen, lower alkyl, lower alkenyl, 
aryl, heteroaryl, carbonyl, thiocarbonyl, ketone, aldehyde, amino, acylamino, amido, 
amidino, cyano, nitro, hydroxyl, azido, sulfonyl, sulfoxido, sulfate, sulfonate, sulfamoyl, 
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sulfonamido, phosphoryl, phosphonate, phosphinate, -(CH2)palkyl, -(CH2)palkenyl, - 
(CH2)palkynyl, -(CH2)paryl, -(CH2)paralkyl, -(CH2)pOH, .(CH2)pO-lower alkyl, - 
(CH2)pO-lower alkenyl, .0(CH2)nR, -(CH2)pSH, -(CH2)pS-lower alkyl, -(CH2)pS- 
lower alkenyl, -S(CH2)nR, -(CH2)pN(R)2, -(CH2)pNR-lower alkyl, "(CH2)pNR-lower 
5 alkenyl, -NR(CH2)nR> and protected forms of the above; 

Cy' representsa substituted or unsubstituted aryl, heterocyclyl, heteroaiyl, or 
cycloalkyl, including polycyclic groups; 

j represents, independently for each occiuience, an integer fi:om 0 to 10, 
preferably from 2 to 7; 

10 i represents, independently for each occurrence, an integer fix>m 0 to S, 

preferably from 0 to 2; and 

p and n, individually for each occurrence, represent integers from 0 to 10, 
preferably from 0 to 5. 

15 In certain embodiments, M represents, independently for each occurrence, a 

substituted or unsubstituted methylene group, such as -CHj-, -CHF-, -CHOH-, - 
CH(Me)-, -C(=0)-, etc. 

In certain embodiments, Cy' represents a substituted or imsubstituted bicyclic or 
heterocyclic ring system, preferably both bicyclic and heteroaiyl, such as 

20 benzothiophene, benzofuran, benzopyrrole, benzopyridine, etc. In certain embodiments, 
Cy' is directly attached to X. In certain embodiments, Cy' is a monocyclic aryl or 
heteroaiyl ring substituted at least with a substituted or unsubstituted aryl or heteroaiyl 
ring, e.g., forming a biaiyl system. In certain embodiments, Cy' includes two substituted 
or unsubstituted aryl or heteroaryl rings, e.g., the same or different, directly connected 

25 by one or more bonds, e.g., to form a biaryl or bicyclic ring system. 

In certain embodiments, Y is absent from all positions. In embodiments wherein 

Y is present in a position, i preferably represents an integer from 1-2 in an adjacent Mf if 

i=0 would result in two occurrences of Y being directly attached, or an occurrence of Y 

being directly attached to N or NR2. 

30 In certain embodiments, X is selected from -C(=0>, -C(=S)-, and -S(02)-. 
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Ih certain embodiments, NII2 represents a primary amine or a secondary or 
tertiary amine substituted with one or two lower alkyl groups, aryl groups, or aralkyl 
groups, respectively, preferably a primary amine. 

In certain embodiments, Ri and R2 represent, independently and as valley 
5 permits, from 0-5 substituents on the ring to which it is attached, selected from halogen, 
lower alkyl, lower alkenyl, carbonyl, tbiocarbonyl, ketone, aldehyde, amino, acylamino, 
cyano, nitro, hydroxyl, azido, sulfonyl, sulfoxido, sulfate, stilfonate, sulfamoyl, 
sulfonamido, phosphoryl, phosphonate, phosphinate, -(CH2)palkyl, -(CH2)palkenyl, - 
(CH2)palkynyl, -(CH2)paryl, -(CH2)paralkyl, -(CH2)pOH, -(CH2)pO-lower alkyl, - 
10 (CH2)pO-lower alkenyl, -0(CH2)nR, -(CH2)pSH, -(CH2)pS-lower alkyl, -(CH2)pS- 
lower alkenyl, -S(CH2)nR, -(CH2)pN(R)2, -(CH2)pNR-lower alkyl, -(CH2)pNR-lower 
alkenyl, -NR(CH2)nR» and protected forms of the above, wherein p, individually for 
each occurence, represents an integer from 0 to 10, preferably from 0 to 5. 

15 In certain embodiments, compounds usefril in the present invention may be 

represented by general formula pCVH): 



Formula XVn 

wherein, as valence and stability permit, 

Cy' represents a substituted or unsubstituted aryl or hetCToaiyl ring, including 
20 polycyclics; 




X 



Y 

I 
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Y, independently for each occurrence, may be absent or represent -N(R)-, -0-, - 
S-, or -Ses 

X can be selected from -C(=0>, -C(=S)-, -8(02)-, -S(OK -C(=NCN)-, - 
P(=0)(OR2)-, and a methylene group optionally substituted witii 1-2 groups such as 
s lower alkyl, alkenyl, or alkynyl groups; 

M represents, independently for each occurrence, a substituted or unsubstituted 
methylene group, such as -CH2-, -CHF-, -CHOH-, -CH(Me)-, -C(=0)-, etc., or two M 
taken together represent substituted or xmsubstituted ethene or efhyne; 

R represents, independentiy for each occurrence, H or substituted or 
10 xmsubstituted aryl, heterocyclyl, heteroaryl, aralkyl, heteroaralkyl, alkynyl, alkenyl, or 
alk/1, or two R taken together may form a 4- to 8-membered ring, e.g., with N; 

Cy represents substituted or unsubstituted aryl, heterocyclyl, heteroaryl, or 
cycloalkyl, including polycyclic groups; 

i represents, independentiy for each occurrence, an integer from 0 to 5, 
15 preferably from 0 to 2; and 

n and p, individually for each occurrence, represent integers from 0 to 10, 
preferably from 0 to 5. 

In certain embodiments, M represents, independentiy for each occurrence, a 
20 substituted or vmsubstituted methylene group, such as -CH2-, -CHF-, -CHOH-, - 
CH(Me)-, -C(=0)-, etc. 

In certain embodiments, Cy' represents a substituted or unsubstituted bicyclic or 
hetCToaryl ring system, preferably both bicyclic and heteroaryl, e.g., benzofhiophene, 
benzofuran, benzopyrrole, benzopyridyl, etc. In certain embodiments, Cy' is directiy 
25 attached to X. hi certain embodiments, Cy' is a monocyclic aiyl or heteroaryl ring 
substituted at least with a substituted or xmsubstituted aryl or heteroaryl ring, e.g., 
forming a biaryl system. In certain embodiments, Cy' includes two substituted or 
unsubstituted aiyl or heteroaryl rings, e.g., the same or dijSerent, directiy connected by 
one or more bonds, e.g., to fomi a biaryl or bicyclic ring system. 
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In certain embodiments, Y is absent jQrom all positions. In embodiments wherein 
Y is present in a position, i preferably represents an integer from 1-2 in an adjacent M-, if 
i=0 would result in two occurrences of Y being directly attached, or an occurrence of Y 
being directly attached to N or NR2- 
5 In certain embodiments, X is selected from -C(=0)-, -C(=S)-, and -S(02)-. 

bi certain embodiments, NR2 represents a primary amine or a secondary or 
tertiary amine substituted with one or two lower alkyl groups, aryl gFoiQ>s, or aralkyl 
groups, respectively, preferably a primary amine. 

In certain embodiments, Cy represents a substituted or unsubstituted non- 
10 aromatic carbocyclic or heterocyclic ring, i.e., including at least one sp^ hybridized 
atom, and preferably a plurality of sp*^ hybridized atoms. In certain embodiments, Cy is 
directly attached to N and/or to NR2. In certain embodiments, Cy is a 5- to 7-membered 
ring. In embodim^ts wherein Cy is a six-membered ring directly attached to N and 
bears an amino substituent at the 4 position of the ring relative to N, the N and amine 
15 substituents may be disposed trans on the ring. 

In certain embodiments, Ri and R2 represent, independently and as valency 
permits, from 0-5 substituents on the ring to which it is attached, selected from halogen, 
lower alkyl, lower alkenyl, carbonyl, thiocarbonyl, ketone, aldehyde, amino, acylamino, 
cyano, nitro, hydroxyl, azido, sulfonyl, sulfoxido, sulfate, sulfonate, sulfamoyl, 
20 sulfonamide, phosphoryl, phosphonate, phosphinate, -(CH2)palkyl, -(CH2)paIkenyl, - 
(CH2)palkynyl, -(CH2)paryl, -(CH2)paralkyl, -(CH2)pOH, -(CH2)pO-lower alkyl, - 
(CH2)pO-lower alkenyl, -0(CH2)nR, «(CH2)pSH, -(CH2)pS.lower alkyl, -(CH2)pS- 
lower alkenyl, -S(CH2)nR, -(CH2)pN(R)2, -(CH2)pNR-lower alkyl, .(CH2)[W"lower 
alkenyl, -NR(CH2)nI^ and protected forms of tiie above, v^erein p, individually for 
25 each occurence, represents an integer from 0 to 10, preferably from 0 to 5. 
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In certain embodiments, a subject compound has the structure of Formula XVHI: 




Therein, as valence and stability permit, 

Cy represents a substituted or unsubstituted heterocyclyl or cycloalkyl; 
Cy* is a substituted or unsubstituted aryl or heteroaryl ring; 
W is O or S; 

R represents, independently for each occurrence, H or substituted or 
unsubstituted aryl, heterocyclyl, heteroaryl, aralfcjrl, heteroaralkyl, alkynyl, alkenyl, or 
allqrl, or two R taken together may form a 4- to 8-membered rixig, e.g., with N; 

Ri and R2 represent, independently and as valency permits, from 0-5 substituents 
on the ring to which it is attached, selected from halogen, lower alkyl, lower alkenyl, 
aryl, heteroaiyl, carbonyl, thiocarbonyl, ketone, aldehyde, amino, acylamino, amido, 
amidino, cyano, nitro, hydroxyl, azido, sulfonyl, sulfoxido, sulfate, sulfonate, sul&moyl, 
sulfonamide, phosphoryl, phosphonate, phosphinate, -(CH2)palkyl, -(CH2)palkenyl, - 
(CH2)palkynyl, -(CH2)paryl, -(CH2)paralkyl, -(CH2)pOH, -(CH2)pO-lower alkyl, - 
(CH2)pO-lower alkenyl, -0(CH2)nR, -(CH2)pSH, -(CH2)pS-lower alkyl, -(CH2)pS- 
lower alkenyl, -S(CH2)nR, -(CH2)pN(R)2, -(CH2)pNR-lower alkyl, -(CH2)pNR-lower 
alkenyl, -NR(CH2)nR* and protected forms of the above; 

n and p, individually for each occurrence, represent integers from 0 to 10. 
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In certain embodiments, Cy' represents a substituted or unsubstituted bicyclic or 
heteroaryl ring system, preferably both bicyclic and heteroaryl, e.g., benzothiophene, 
benzofuran, benzopyiiole, benzopyridyl, etc. In certain embodiments, Cy' is directly 
attached to X. 

5 In certain embodiments, NR2 represents a primary amine or a secondary or 

tertiary amine substituted with one or two lower alkyl groups, aryl groups, or aralkyl 
groups, respectively, preferably a primary amine. 

In certain embodiments, Cy represents a substituted or imsubstituted saturated 
carbocyclic or heterocyclic ring, i,e., composed of a plurality of sp hybridized atoms. In 

10 certain embodiments, Cy is a 5- to 7-membered ring. In embodiments wherein Cy is a 
six-membered ring directly attached to N and bears an amino substituent at the 4 
position of the ring relative to N, the N and amine substituents may be disposed trans on 
the ring. 

In certain embodhnents, Ri and R2 represent, independently and as valency 
15 permits, from 0-5 substituents on the ring to which it is attached, selected from halogen, 
lower alkyl, lower alkenyl, carbonyl, thiocarbonyl, ketone, aldehyde, amino, acylamino, 
cyano, nitro, hydroxyl, sulfonyl, sulfoxide, sulfate, sulfonate, sulfamoyl, sulfonamido, - 
(CH2)palkyl, -(CH2)palkenyl, .(CH2)palkynyl, -(CH2)paryl, -(CH2)paralkyl, -(CH2)pOH, 
-(CH2)p04ower alkyl, -(CH2)pO-lower alkenyl, -0(CH2)nR, -(CH2)pSH, -(CH2)pS- 
20 lower alkyl, -(CH2)pS-lower alkenyl, -S(CH2)nR. -(CH2)pN(R)2, -(CH2)pNR-lower 
alkyl, -(CH2)pNR-lower alkenyl, -NR(CH2)nR, and protected forms of the above. 

In certain embodiments, a subject compound has a structure of Formula XIX: 
w 




wherein, as valence and stability permit. 
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U represents a substituted or unsubstituted aryl or heteroaryl ring fiised to tiie 
nitrogen-containing ring; 

V represents a lower alkylene group, such as methylene, 1,2-ethylene, 1,1- 
efhylene, 1,1 -propylene, 1,2-propylene, l,3-propylene,etc.; 
5 W represents S or O, preferably O; 

X represents C==0, C=S, or SCh; 

R3 represents substituted or unsubstituted aryl, heteroaryl, lower alkyl, lower 
alkenyl, lower alkynyl, carbocyclyl, carbocyclylalkyl, heterocyclyl, heterocyclylalkyl, 
aralkyl, or heteroaralkyl; 
10 R4 represents substituted or unsubstituted aralkyl or lower alkyl, such as 

phenethyl, ben2yl, or aminoalkyl, etc.; 

R5 represents substituted or unsubstituted aryl, heteroaryl, aralkyl, or 
heteroaralkyl, including polycyclic aromatic or heteroaromatic groups. 

15 In certain embodiments, U represents a phenyl ring fused to the nitrogen- 

containing ring. 

In certain embodiments, R3 is selected from substituted or unsubstituted aryl, 
heteroaiyl, lower alkyl, lower aflcenyl, aralkyl, and heteroaralkyl. 

In certain embodiments, R4 is an unsubstituted lower alkyl group, or is a lower 
20 alkyl group substituted with a secondary or tertiary amine. 

In certain embodiments, Rs is selected from substituted or unsubstituted phenyl 
or naphthyl, or is a diaiylalkyl group, such as 2,2-diphenylethyl, diphenylmethyl, etc. 

Moreover, the subject methods can be performed on cells which are provided in 
culture (m vitro\ or on cells in a whole animal (in vivo). See, for example, PCT 
25 publications WO 95/18856 and WO 96/17924 (the specifications of which are expressly 
incorporated by reference herein). 

VI. Testing for Biolopical Activity 

While many bioassays have been used to demonstrate hedgehog activity, the 
30 C3H10T1/2 cell line provides a simple system for assessing hedgehog function without 
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the complication of having to work with primary cell cultures or organ explants. The 
mouse embryonic fibroblast line C3H10T1/2 is a mesenchymal stem cell line that, under 
defined conditions, can differentiate into adipocytes, chondrocytes, and bone osteoblasts 
(Taylor, S.M., and Jones, P.A., Cell 17: 771-779 (1979) and Wang, E.A,, et al,, Growth 

5 Factors 9: 57-71 (1993)). Bone morphogenic proteins drive the differentiation of C3H 
lOT 1/2 cells into the bone cell lineage and alkaline phosphatase induction has been 
used as a marker for this process (Wang et al., supra). Shh has a similar effect on 
C3H10T1/2 cells (Kinto, N. et al., FEBS Letts. 404: 319-323 (1997)) and we routinely 
use the alkaline phosphatase induction by Shh as a quantitative measure of its in vitro 

10 potency. Shh treatment also produces a dose-dependent increase in gli-1 and ptc-1 
expression, which can be readily detected by a PCR-based analysis. 

We found that hedgehog protein can upregulate fibroblast e^qpression of 
angiogenic growth factors, includmg VEGF121, VEGF165, VEGF189, Ang-1, and 
Ang-2 (Example 4). Thus, the procedure outlined in Example 4 provides a new method 

15 of measuring the in vitro angiogenic potential of hedgehog. Without wishing to be 
bound by any particular theory, this irpregulation may e3q>lain the mechanism whereby 
hedgehog exerts its angiogenic effect. 

Similarly, this cell line provides a simple bioassay to test the agonistic or 
antagonistic properties of the hedgehog therapeutics of the present invention. In 

20 preferred embodiments, agonists would be expected to induce alkaline phosphatase in 
CSHlOTl/2 cells. In other embodiments, antagonists would be expected to inhibit the 
induction of alkaline phosphatase by exogenous hedgehog. 

Further, persons having ordinary skill in the art will recognize means for 
determining if the hedgehog agents used in the present methods are efficacious in vivo. 

25 For instance, clinicians have available to them a variety of non-invasive tests such as 
echograms, electrocardiograms, CAT scans, MRI to determine vascular and cardiac 
functioning. Other methods include angiography and other more invasive physiological 
testing methods. For patients with neuropathies, nerve conduction velocity tests may be 
routinely performed. To test for the anti-angiogemc Amotion of hedgehog antagonists, 

30 persons of ordinary skill in the art way use a variety of imaging methods such as CAT 
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and MRI scans, as well as more invasive tests to look at blood chemistry and tumor 
metabolism. 

Vn, Subjects for Treatment 
5 As a general matter, the methods of the present invention may be utilized for any 

mammalian subject needing modulation of angiogenesis. Mammalian subjects which 
may be treated according to the methods of the invention include, but are not limited to, 
human subjects or patients. In addition, however, the invention may be employed in the 
treatment of domesticated mammals which are maintained as human companions (e.g., 
10 dogs, cats, horses), \sdiiich have significant commercial value (e.g., dairy cows, beef 
cattle, sporting animals), which have significant scientific value (e»g., captive or firee 
specimens of endangered species), or which otherwise have value. In addition, as a 
general matter, the subjects for treatment with the methods of the present invention need 
not present indications for treatment with the agents of the invention other than those 
15 indications associated with need for modulation of angiogenesis. That is, the subjects for 
treatment are expected to be otherwise fiee of indications for treatment with the 
hedgehog therapeutic agents of the invention. 

One of ordinary skill in the medical or veterinary arts is trained to recognize 
subjects which may need modulation of angiogenesis . Jn particular, clinical and non- 
20 clinical trials, as well as accumulated experience, relating to the presently disclosed and 
other methods of treatment, are expected to inform the skilled practitioner in deciding 
whether a given subject is in need of modulation and v^ether any particular treatment is 
best suited to the subject's needs, including treatment according to the present invention. 

25 VnL Utilities, Formulations and Methods of Treatment 
A. General 

We show that hedgehog receptor (ptcl) is normally expressed in the vasculature. 
We used a mouse which carries the lacZ reporter gene under the control of the 
endogenous ptc 1 promoter to determine the expression of ptcl in normal adult animal ja 
30 (Example 1). We further determined that mice injected with hedgehog protein for 3 days 
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showed no obvious physical or behavioral differences compared to vehicle-treated or 
untreated littermates. The vascular and cardiovascular staining pattern for ptcl seen in 
normal animals intensifies significantly in animals injected with increasing doses of 
hedgehog protein. Our data show that systemic administration of hedgehog can induce 
s ptcl upregulation and indicate that these vascular tissues are responsive to hedgehog 
protein. 

We further determined that hedgehog induces neovascularization in a comeal 
model of angiogenesis (Example 3) as well as a matrigel plug model of angiogenesis 
(Example 2) . We further found that there was a striking qualitative difference in the 

10 appearance of vessels induced by hedgehog compared to VEGF. VEGF induced a fine 
mesh of capillaries which are short tortous sprouts firom the extended branches of the 
preexisting limbus vessels at the base of the eye. In contrast, hedgehog induced much 
larger vessels which extended all the way to the pellet and contained numerous 
anastamoses between the venous and arterial circulation 

IS Moreover, we employed surgical ligation of the femoral artery and removal of a 

segment of the artery distal to the ligation in mice to induce limb ischemia (Example S). 
We foimd that hedgehog improves recovery fi-om such ischemic limb injury. 

In yet another clinically relevant animal model, we placed an ameroid constrictor 
around the left circumflex coronary artery of pigs. We determined that hedgehog protein 

20 or gene therapy can also improve these measures of cardiac perfusion, viability and 
function following ischemia in this model (Example 6). We determined that hedgehog 
protein is overexpressed in several human gastrointestinal tumor cell lines compared to 
normal human gastrointestinal epithelial cells or fibroblasts (Example 7) and that 
inhibition of hedgehog using, for example, anti-hedgehog blocking antibody, may 

25 decrease tumor growth rate and/or tumor angiogenesis (Example 7). 

Accordingly, the methods of this invention may employ hedgehog therapeutics 
or biologically active portions thereof, to promote angiogenesis, such as, to repair 
damage of myocardial tissue as a result of myocardial infarction. Such methods may 
also include the repair of the cardiac vascular system after ischemia including the 

30 growth of collateral vasculature. Methods utilizing hedgehog therapeutics may be 
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employed to stimulate the growth of transplanted tissue and collateral vasculature where 
coronary bypass surgery is performed. Methods may also treat damaged vascular tissue 
as a result of coronary artery disease and peripheral or central nervous system vascular 
disease or ischemi£L 

5 Methods of the invention may also promote wound healing, particularly to re- 

vascularize damaged tissues or stimulate collateral blood flow during ischemia and 
where new capillary angiogenesis is desired. Other methods of flie invention may be 
employed to treat full-thickness woimds such as dermal ulcers, including pressure sores, 
venous ulcers, and diabetic ulcers. In addition, methods employing hedgehog 

10 therapeutics may be employed to treat full-thickness bums and injuries where a skin 
graft or flap is used to repair such bums and injuries. Such hedgehog therapeutics may 
also be employed for use in plastic surgery, for example, for the repair of lacerations, 
bums, or other trauma. In urology, methods of the invention may assist in recovery of 
erectile function. In the field of female reproductive health, methods of the inv^tition 

13 may assist in the modulation of menstruation, ovulation, endometrial lining formation 
and maintanence, and placentation. 

Since angiogenesis is important in keeping wounds clean and non-infected, 
methods may be employed in association with surgery and following the repair of cuts. 
They may also be employed for the treatment of abdominal wounds where there is a 

20 high risk of infectioiL Methods using hedgehog therapeutics described herein may be 
employed for the promotion of endotheliaUzation in vascular graft surgery. In the case 
of vascular grafts using either transplanted or synthetic material, hedgehog therapeutics 
can be applied to the surface of the graft or at the junction to promote the growth of 
vascular smooth muscle and adventitial cells in conjunction with endothelial cells. 

25 Methods of the invention may also be employed to coat artificial prostheses or 

natural organs which are to be transplanted in the body to minimize rejection of the 
transplanted material and to stimulate vascularization of the transplanted materials and 
may also be employed for vascular tissue repair, for example, that occurring during 
arteriosclerosis and required following balloon ai^oplasty ^ere vascular tissues are 
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damaged. Specifically, methods of the invention may be employed to promote recovery 
from arterial wall injury and thereby inhibit restenosis. 

Nucleic acid sequences encoding hedgehog therapeutics may also be employed 
for in vitro purposes related to scientific research, synthesis of DNA and manufacture of 

5 DNA vectors, and for the production of diagnostics and therapeutics to treat human 
disease. For example, methods of the invention may involve in vitro culturing of 
vascular smooth muscle cells, fibroblasts, hematopoietic cells, muscle, myotendonous 
jimction, bone or cartilage- derived cells and other mesenchymal cells, where a 
hedgehog therapeutic is added to the conditional medium in a concentration firom 10 

10 ng/ml to 20 ug/ml. 

Antagonistic hedgehog therapeutics may be employed to limit angiogenesis 
necessary for solid tumor metastasis. The identification of antagonists can be used for 
the generation of certain inhibitors of vascular endothelial growth factor. Since 
angiogenesis and neovascularization are essential steps in solid tumor growth, inhibition 

IS of angiogenic activity of the vascular endothelial growth factor is very useful to prevent 
the further growth, retard, or even regress solid tumors. Gastrointestinal tumors and 
gliomas are also a type of neoplasia which may be treated with the antagonists of the 
present invention. 

In addition to these disorders, the antagonists may also be employed to treat 
20 retinopathy associated with diabetes, rheumatoid arthritis, osteoarthritis, macular 
degeneration, glaucoma. Keloid formation, ulcerative colitis, Krohn's disease, psoriasis, 
and oth^ conditions caused are exacerbated by inoreased angiogenic activity. The 
antagonists may be employed in a composition with a pharmaceutically acceptable 
carrier, e.g., as described herein. 
25 These therapeutic agents may be administered by any route which is compatible 

with the particular agent employed. The hedgehog therapeutic agents of the invention 
may be provided to an individual by any suitable means, preferably directly (e.g., 
locally, as by injection or topical administration to a tissue locus) or systemically (e.g., 
parenterally or orally). Where the agent is to be provided parenterally, such as by 
30 intravenous, intraarterial, subcutaneous, or intramuscular, administration, the agent 
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preferably comprises part of an aqueoxis solution. The solution is physiologically 
acceptable so that in addition to delivery of the desired agent to the subject, the solution 
does not otherwise adversely affect the subject's electrolyte and/or volume balance* The 
aqueous medium for the hedgehog Aerapeutic may comprise normal physiologic saline 

5 (e.g., 9.85% NaCl, 0. 1 5M, pH 7-7.4). 

The hedgehog ther^eutics are preferably administered as a sterile 
pharmaceutical composition containing a pharmaceutically acceptable carrier, which 
may be any of the numerous well known carriers, such as water, saline, phosphate 
buffered saline, dextrose, glycerol, ethanol, and the like, or combinations thereof The 

10 compounds of the present invention may be used in the form of pharmaceutically 
acceptable salts derived from inorganic or organic acids and bases. Included among such 
acid salts are the following: acetate, adipate, alginate, aspartate, benzoate, 
b^[izmesulfonate, bisulfate, butyrate, citrate, camphorate, camphorsulfonate, 
cyclopentanepropionate, digluconate, dodecylsulfate, ethanesulfonate, fumarate, 

15 glucoheptanoate, glycerophosphate, hemisulfate, heptanoate, hexanoate, hydrochloride, 
hydrobromide, hydroiodide, 2-hydroxyethanesulfonate, lactate, maleate, 
methanesulfonate, 2-naphthalenesuIfonate, nicotinate, oxalate, pamoate, pectinate, 
persulfate, 3-phenyl-propionate, picrate, pivalate, propionate, succinate, tartrate, 
thiocyanate, tosylate and undecanoate. Base salts include ammonium salts, alkali metal 

20 salts, such as sodium and potassium salts, alkaline earth metal salts, such as calcium and 
magnesium salts, salts with organic bases, such as dicyclohexylamine salts, N-methyl- 
D-glucamine, trisOiydroxymethyl)metfaylamine and salts with amino acids such as 
arginine, lysine, and so fordi. Also, the basic nitrogen-containing groups can be 
quatemized with such agents as lower alkyl halides, such as methyl, ethyl, propyl, and 

25 butyl chloride, bromides and iodides; dialkyl sul&tes, such as dimetiiyl, diethyl, dibutyl 
and diamyl sulfates, long chain halides such as decyl, lauryl, myristyl and stearyl 
chlorides, bromides and iodides, aralkyl halides, such as benzyl and phenethyl bromides 
and others. Water or oil-soluble or dispersible products are thereby obtained. 

Phamiaceutical conq)ositions of hedgehog therapeutics comprise any of the 

30 compoxmds of the present invention, or pharmaceutically acceptable derivatives thereof, 
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together with any phannaceiitically acceptable carrier. The term "carrier" as used herein 
includes acceptable adjuvants and vehicles. Phannaceutically acceptable carriers that 
may be used in the pharmaceutical compositions of this invention include, but are not 
limited to, ion exchangers, aluxnina, alimiinum stearate, lecithin, serum proteins, such as 

5 human serum albumin, buffer substances such as phosphates, glycine, sorbic acid, 
potassium sorbate, partial glyceride mixtures of saturated vegetable fatty acids, water, 
salts or electrolytes, such as protamine sul&te, disodium hydrogen phosphate, potassium 
hydrogen phosphate, sodiiun chloride, zinc salts, colloidal silica, magnesium trisilicate, 
polyviayl pyrrolidone, cellulose-based substances, polyethylene glycol, sodium 

10 carboxymethylcellulose, polyacrylates, waxes, polyethylene-polyoxypropylene-block 
polymers, polyethylene glycol and wool fat 

According to this invention, the pharmaceutical compositions may be in the form 
of a sterile injectable preparation, for example a sterile injectable aqueous or oleaginous 
suspension. This suspension may be formulated according to techniques known in the 

15 art using suitable dispersing or wetting agents and suspending agents. The sterile 
injectable preparation may also be a sterile injectable solution or suspension in a non- 
toxic parenterally-acceptable diluent or solvent, for example as a solution in 1,3- 
butanediol. Among the acceptable vehicles and solvents that may be employed are 
water. Ringer's solution and isotonic sodiimi chloride solution. In addition, sterile, fixed 

20 oils are conventionally employed as a solvent or suspending medium. For this purpose, 
any bland fixed oil may be employed including synthetic mono- or di-glycerides. Fatty 
acids, such as oleic acid and its glyceride derivatives are usefid in the preparation of 
injectables, as do natural pharmaceutically-acceptable oils, such as olive oil or castor oil, 
especially in their polyoxyethylated versions. These oil solutions or suspensions may 

25 also contain a long-chain alcohol diluent or dispersant. 

Controlled release administration of a particular hedgehog therapeutic may be 
usefiil. For example, the therapeutic may be administered using intravenous infusion, an 
implantable osmotic pump, a transdermal patch, liposomes, or otiier mod^ of 
administration. In one embodiment, a pump may be used [Langer et al., eds.. Medical 

30 Applications of Controlled Release, CRC Pres., Boca Raton, Fla. (1974); Sefton, CRC 
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Crit Ref. Biomed. Eng., 14:201 (1987); Buchwald et al.. Surgery, 88:507 (1980); 
Saudek et al., N. Engl. J. Med., 321:574 (1989)]. In another embodiment, polymeric 
materials can be \ised [see, Langer, 1974, supra; Sefton, 1987, supra; Smolen et al,, eds.. 
Controlled Drug Bioavailability, Drug Product Design and Performance, Wiley, N.Y. 

5 (1984); Ranger et al., J. Macromol. Sd. Rev. Macromol. Chem., 23:61 (1983); see also 
Levy et al.. Science, 228:190 (1985); During et al., Ann. Neurol., 25:351 (1989); 
Howard et al., J. Neurosurg., 71:105 (1989)]. In yet another embodiment, a controlled 
release system can be placed in proximity of the therapeutic target, e.g., a tumor, thus 
requiring only a fraction of the systemic dose [see. e.g., Goodson, in Medical 

10 Applications of Controlled Release, vol. 2, pp. 115-138 (1984)]. Other controlled 
release systems are discussed in the review by Langer, Science, 249:1527-1533 (1990). 
In another embodiment, the therapeutic compoimd can be delivered in a vesicle, in 
particular a liposome (see Langer, 1990, supra); Treat et al., in Liposomes in the 
Therapy of Infectious Disease and Cancer, Lopez-Berestein and Fidler (eds.), Liss, New 

15 York, pp. 353-365 (1989); Lopez-Berestein, pp. 317-327; see generally id.). 

B. Oral Delivery 

Contemplated for use herein are oral solid dosage forms, which are described 
generally in Martin, Chapter 89, 1990, supra, which is herein incorporated by reference. 

20 Solid dosage forms include tablets, capsules, pills, troches or lozenges, cachets or 
pellets. Also, liposomal or proteinoid encapsulation may be used to formulate the 
present compositions (as, for example, proteinoid microspheres reported in U.S. Pat. No. 
4,925,673). Liposomal encapsulation may be used and the liposomes may be derivatized 
with various polymers (e.g., U.S. Pat. No. 5,013,556). A description of possible solid 

25 dosage forms for the therapeutic is given by Marshall, in Modem Pharmaceutics, 
Ch£^ter 10, Banker and Rhodes ed., (1979), herein incorporated by reference. In 
general, the formulation will include the therapeutic (or chemically modified form), and 
inert ingredients which allow for protection against the stomach environment, and 
release of the biologically active material in the intestine. 
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For the protein (or derivative) the location of release may be the stomach, the 
small intestine (the duodenum, the jejunem, or the ileum), or the large intestine. One 
skilled in the art has available formulations which will not dissolve in the stomach, yet 
will release the material in the duodenum or elsewhere in the intestine. Preferably, the 

5 release will avoid the deleterious effects of the stomach environment, either by 
protection of the protein (or derivative) or by release of the biologically active material 
beyond the stomach environment, such as in the intestine. To ensure full gastric 
resistance, a coating impermeable to at least pH. 5.0 is essential. Examples of the more 
common inert ingredients that are used as enteric coatings are cellulose acetate 

10 trimellitate (CAT), hydroxypropyhnethylceUulose phthalate (HPMCP), HPMCP 50, 
HPMCP 55, polyvmyl acetate phthalate (PVAP), Eudragit L30D, Aquateric, cellulose 
acetate phthalate (CAP), Eudragit L, Eudragit S, and Shellac. These coatings may be 
used as mixed films. A coating or mixture of coatings can also be used on tablets, which 
are not intended for protection against the stomach. This can include sugar coatings, or 

15 coatings which make the tablet easier to swallow. Capsules may consist of a hard shell 
(such as gelatin) for delivery of dry therapeutic i.e. powder; for liquid forms, a soft 
gelatin shell may be used. The shell material of cachets could be thick starch or other 
edible paper. For pills, lozenges, molded tablets or tablet triturates, moist massing 
techniques can be used. 

20 The therapeutic can be included in the formulation as fine multiparticvdates in 

the form of granules or pellets of particle size about 1 mm. The formulation of the 
material for c^sule administration could also be as a powder, lightly compressed plugs 
or even as tablets. The therapeutic could be prepared by compression. Colorants and 
flavoring agents may all be included. For example, the protein (or derivative) may be 

25 formulated (such as by liposome or microsphere encapsulation) and tiien fiirther 
contained within an edible product, such as a refrigemted beverage containing colorants 
and flavoring agents. One may dilute or increase the volume of the therapeutic with an 
inert material. Hxese diluents could include carbohydrates, especially mannitol, alpha - 
lactose, anhydrous lactose, cellulose, sucrose, modified dextrans and starch. Certain 

30 inorganic salts may be also be used as fillers including calciiun triphosphate, magnesium 
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carbonate and sodium chloride. Some commercially available diluents are Fast-Flo, 
Emdex, STA-Rx 1500, Emcompress and AvicelL Disintegrants may be included in the 
formulation of the therapeutic into a solid dosage form. Materials used as disintegrants 
mclude but are not limited to starch including the commercial disintegrant based on 

5 starch, Explotab. Sodium starch glycolate, Amberlite, sodium carboxymethylcellulose, 
ultramylopectin, sodiiun alginate, gelatin, orange peel, acid carboxymethyl cellulose, 
natural sponge and bentonite may all be used. Another form of the disintegrants are the 
insoluble cationic exchange resins. Powdered gums may be used as disintegrants and as 
binders and these can include powdered gums such as agar, Karaya or tragacanth. 

10 Alginic acid and its sodium salt are also useful as disintegrants. Binders may be used to 
hold the therapeutic agent together to form a hard tablet and include materials from 
natural products such as acacia, tragacanth, starch and gelatin. Others include methyl 
cellulose (MC), etiiyl cellidose (EC) and carboxymethyl cellulose (CMC). Polyvinyl 
pyrrolidone (PVP) and hydroxypropylmethyl cellulose (HPMC) could both be used in 

15 alcoholic solutions to granulate the therapeutic. An antifrictional agent may be included 
in the formulation of the therapeutic to prevent sticking during the formulation process. 
Lubricants may be used as a layer between the therapeutic and the die wall, and these 
can include but are not limited to: stearic acid including its magnesium and calcium 
salts, polytetrafluoroethylene (PTFB), liquid parafBn, vegetable oils and waxes. Soluble 

20 lubricants may also be used such as sodium lauryl sulfate, magnesium lauryl siilfate, 
polyethylene glycol of various molecular weights, and Carbowax 4000 and 6000. 
Glidants that might improve the flow properties of the drug during formulation and to 
aid rearrangement during compression might be added. The glidants may include starch, 
talc, pyrogenic silica and hydrated silicoaluminate. 

25 To aid dissolution of the ther^eudc into the aqueous environment, a sur&ctant 

might be added as a wetting agent Surfactants may include anionic detergents such as 
sodium lauryl sulfete, dioctyl sodium sulfosuccinate and dioctyl sodium sulfonate. 
Cationic detergents might be used and could include benzalkonium chloride or 
benzethomium chloride. The list of potential nonionic detergents diat could be included 

30 in the formulation as suifectants are lauromacrogol 400, polyoxyl 40 stearate, 
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polyoxyethylene hydrogenated castor oil 10, 50 and 60, glycerol monostearate, 
polysorbate 40, 60, 65 and 80, sucrose fatty acid ester, methyl cellulose and 
carboxymethyl cellulose. These surfactants could be present in the formulation of the 
protein or derivative either alone or as a mi>cture in different ratios. Additives which 
5 potentially enhance uptake of the protein (or derivative) are for instance the fatty acids 
oleic acid, linoleic acid and linolenic acid. 

C. Pulmonary Delivery 

Also contemplated hereia is pulmonary delivery of the preset protems (or 

10 derivatives tiiereof). The protein (or derivative) is delivered to the lungs of a mammal 
while mhaling and traverses across the lung epithelial lining to the blood-stream. Other 
reports of this include Adjei et al.. Pharmaceutical Research, 7(6):565-569 (1990); Adjei 
et al.. International Journal of Pharmaceutics, 63:135-144 (1990) (leuprolide acetate); 
Braquet et al., Joumal of Cardiovascular Pharmacology, 13(suppl. 5):143-146 (1989) 

15 (endothelia-1); Hubbard et al.. Annals of Memal Medicine, 3(3):206-212 (1989) ( alpha 
1-antitrypsin); Smith et al., J. Clm. Invest., 84:1145-1146 (1989) ( alpha 1 -proteinase); 
Os wein et al., "Aerosolization of Proteias", Proceedings of Symposium on Respiratory 
Drug Delivery H, Keystone, Colo., O^arch 1990) (recombioant human growth 
hormone); Debs et al., J. Immunol., 140:3482-3488 (1988) (interferon- gamma and 

20 tumor necrosis factor alpha) and Platz et al., U.S. Pat No. 5,284,656 (granulocyte 
colony stimulating £actor). Contemplated for use in the practice of this invention are a 
wide range of mechanical devices designed for pulmonary delivery of ther^utic 
products, including but not limited to nebulizers, metered-dose inhalers, and powder 
inhalers, all of which are familiar to those skilled in the art. 

23 Some specific examples of commercially available devices suitable for the 

practice of this invention are the Ultravent nebulizer, manufactured by Mallinckrodt, 
Inc., St Louis, Mo.; the Acorn n nebulizer, manufactured by Marquest Medical 
Products, Englewood, Colo.; tibe Ventolin metered-dose inhaler, manufactured by Glaxo 
Inc., Research Triangle Park, N.C.; and the Spinhaler powder inhaler, maniifactured by 

30 Fisons Corp., Bedford, Mass. All such devices reqxiire the use of formulations suitable 
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for the dispensing of protein (or derivative). Typically, each formulation is specific to 
the type of device employed and may involve Hie use of an appropriate propellant 
material, in addition to the usual diluents, adjuvants and/or carriers useful in therapy. 
Also, the use of liposomes, microcapsules or microspheres, inclusion complexes, or 

5 other types of carriers is contemplated. Chemically modified proteia may also be 
prepared in different formulations depending on the type of chemical modification or 
the type of device employed. 

Formulations smtable for use with a nebulizer, either jet or ultrasonic, will 
typically comprise protein (or derivative) dissolved in water at a concentration of about 

10 0.1 to 25 mg of biologically active protein per ml of solution. The formulation may also 
include a buffer and a simple sugar (e.g., for protein stabilization and regulation of 
osmotic pressure). The nebulizer formulation may also contain a surfactant, to reduce or 
prevent surface induced aggregation of the protein caused by atomization of the solution 
in forming the aerosol. 

15 Formulations for use with a metered-dose inhaler device will generally comprise 

a finely divided powder containing the protein (or derivative) suspended in a propellant 
with the aid of a surfactant. The propellant may be any conventional material employed 
for this purpose, such as a chlorofluorocarbon, a hydrochlorofluorocarbon, a 
hydrofluorocarbon, or a hydrocarbon, including trichlorofluoromethane, 

20 dichlorodifluoromethane, dichloiotetrafluoroethanol, and 1,1,1,2-tetrafluoroethane, or 
combinations thereof. Suitable surfactants include sorbitan trioleate and soya lecithin. 
Oleic acid may also be useful as a surfactant. 

Formulations for dispensing fcom a powder inhaler device will comprise a finely 
divided dry powder containiag protein (or derivative) and may also include a bulking 

25 agent, such as lactose, sorbitol, sucrose, or mannitol in amounts which facilitate 
dispersal of the powder from the device, e.g., 50 to 90% by weight of the formulation. 
The protein (or derivative) should most advantageously be prepared in particulate form 
witii an average particle size of less than 10 mum (or microns), most preferably 0.5 to 5 
mum, for most effective delivery to the distal lung. 

30 
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D. Dosages 

For all of the above molecules, as further studies are conducted, information will 
emerge regarding appropriate dosage levels for treatment of various conditions in 
various patients, and the ordinary skilled worker, considering the therapeutic context, 

5 age and general health of the recipient, will be able to ascertain the proper dosage. 
Generally, for injection or infusion, dosage will be between 0.01 mu. g of biologically 
active protein/kg body weight, (calculating the mass of tiie protein alone, without 
chemical modification), and 10 mg/kg (based on the same). The dosing schedule may 
vary, depending on the circulation half-life of the protein or derivative used, whether the 

10 polypeptide is delivered by bolus dose or continuous infusion, and the formulation used. 

E. Administration with Other Compounds 

For therapy associated with modulating angiogenesis, one may administer the 
present hedgehog therapeutics (or derivatives) in conjunction with one or more 

15 pharmaceutical compositions used for treating other clinical compUcations of the need 
for angiogenic modulation, such as those used for treatment of cancer (e.g., 
chemotherapeutics), cachexia, high blood pressure, high cholesterol, and other adverse 
conditions. Administration may be simultaneous or may be in seriatim. Similarly, one 
may administer more than one hedgehog thempeutic (or derivatives), having the same or 

20 differing mode of action, to attain an additive or synergistic effect on angiogenesis. 

F. Nucleic Acid-Based Therapeutic Treatment 

Nucleic acid sequences encoding an antagonisitic hedgehog therapeutic could be 
introduced into human tumor or blood vessel cells to develop gene therapy. Similarly, 
25 nucleic acid sequences encoding an agonistic hedgehog therapeutic could be introduced 
into human cells as a gene therapy based treatment. 

In one embodiment, a nucleic acid sequence encoding a hedgehog therapeutic is 
introduced in vivo in a viral vector. Such vectors include an attenuated or defective 
DNA virus, such as but not limited to herpes simplex virus (HSV), p^illomavirus, 
30 Epstein Barr virus (EBV), adenovirus, adeno-associated virus (AAV), and the like. 
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Defective viruses, which entirely or ahnost entirely lack viral geries, aie preferred ' 
Defective virus is not infective after introduction into a cell. Use of defective viral 
vectors allows for administration to cells in a specific, localized area, without concern 
that the vector can infect other cells. Thus, adipose tissue can be specifically targeted. 

5 Examples of particxilar vectors include, but are not limited to, a defective herpes virus 1 
(HS V 1) vector [Kaplitt et al., Molec. Cell. Neurosci., 2:320-330 (1991)], an attenuated 
adenovirus vector, such as the vector described by Stratford-Perricaudet et al., J. Clin. 
Invest, 90:626-630 (1992), and a defective adeno-associted virus vector [Samulski et 
al., J. ViroL, 61:3096-3101 (1987); Samulski et al., J. ViroL, 63:3822-3828 (1989)]. In 

10 another embodiment, the nucleic acid can be introduced in a retroviral vector, e.g., as 
described in Anderson et al., U.S. Pat No, 5,399,346; Mann et al.. Cell, 33:153 (1983); 
Temin et al., U.S. Pat No. 4,650,764; Temin et al., U.S. Pat No. 4,980,289; Markowitz 
et al., J. ViroL, 62:1120 (1988); Temin et al,, U.S. Pat No. 5,124,263; International 
Patent Publication No. WO 95/07358, published Mar. 16, 1995, by Dougherty et al.; and 

15 Kuo et al.. Blood, 82:845 (1993). 

Alternatively, the vector can be introduced in vivo by lipofection. For the past 
decade, there has been increasing iise of liposomes for encapsulation and transfection of 
nucleic acids in vitro. Synthetic cationic lipids designed to limit the difficulties and 
dangers encountered with liposome mediated transfection can be used to prepare 

20 liposomes for in vivo transfection of a gene encoding a marker [Feigner et al., Proc. 
Natl. Acad. Sci. USA, 84:7413-7417 (1987); see Mackey et al., Proc. Natl. Acad. Sci. 
USA, 85:8027-8031 (1988)]. The use of cationic lipids may promote enc^sulation of 
negatively charged nucleic acids, and also promote fusion with negatively charged cell 
membranes [Feigner et al.. Science, 337:387-388 (1989)]. The use of lipofection to 

25 introduce exogenous genes into specific organs in vivo has certain practical advantages. 
Molecular targeting of liposomes to specific cells represents one area of benefit It is 
clear that directing transfection to particxilar cell types would be particularly 
advantageous in a tissue with cellular heterogeneity, such as the pancreas, Uver, kidney, 
and braiiL Lipids may be chemically coupled to other molecules for the purpose of 

30 targeting (see Mackey et al., 1988, supra). Targeted peptides, e.g., hormones or 
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neurotransmitters, and proteins such as antibodies, or non-peptide molecules could be 

coupled to liposomes chemically. 

It is also possible to introduce the vector in vivo as a naked DNA plasmid. 

Naked DNA vectors for gene therapy can be introduced into the desired host cells by 
5 methods known in the art, e.g., transfection, electroporation, microinjection, 

transduction, cell fusion, DEAE dextran, calcium phosphate precipitation, use of a gene 

gim, or use of a DNA vector transporter (see, e.g., Wu et al., J. BioL Chem., 267:963- 

967 (1992); Wu et al., J. BioL Chem., 263:14621-14624 (1988); Hartmut et al., 

Canadian Patent Application No. 2,012,311, filed Mar. 15, 1990). 
10 It is also possible to introduce the vector in vivo in conjuction with a catheter or 

other device. See Vale et al., 1999: Komowski et al., 2000. 

H. Diagnostics 

A diagnostic method usefiil in the present invention comprises examining a 
IS cellular sample or medimn by means of an assay including an effective amount of an 
antagonist to a hedgehog protein, such as an anti-hedgehog antibody homolog, 
preferably an affinity-purified polyclonal antibody, and more preferably a mAb. In 
addition, it is preferable for the anti-hedgehog antibody molecules used herein be in the 
form of Fab, Fab', F(ab)2 or F(v) portions or whole antibody molecules. As previously 
20 discussed, patients capable of benefiting from this method include those suffering firom 
cancer or other conditions where abnormal angiogenesis is a characteristic or factor. 
Methods for isolating hedgehog protein and inducing anti-hedgehog antibodies and for 
determining and optimizing the ability of anti-hedgehog antibodies to assist in the 
examination of the target cells are all well-known in the art 

25 

The present invention will be illustrated by the following, non-limiting 
examples. These are described in further detail in the pending publication, Pola et al., 
2001, Nature Medicine, incorporated herein. 

30 Example 1. Hedgehog Responsive Cells in Normal Vasculature 
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The Egression of Hedgehog Receptor in Normal Vasculature 

The hedgehog receptor which is coupled directly to the hedgehog signalling 
pathway is patched 1 (ptcl). In addition to being the primary hedgehog receptor in the 
signalling pathway, ptcl gene expression is also induced by signalling through the 

5 hedgehog pathway. The expression of the ptcl gene in cells can thus indicate that the 
cell is potentially responsive to hedgehog proteins and can also show that the cell is in 
the process of responding to hedgehog stimulation. We used a mouse which carries the 
lacZ reporter gene under the control of the endogenous ptcl promotor to determine the 
expression of ptcl in normal adult animals 

10 Ptcl-lacZ mice carry a non disruptive insertion of the lacZ reporter gene 

containing a nuclear localization signal upstream of the ptcl coding region. LacZ 
expression corresponds to ptcl expression (Goodrich et al., 1997; M. Scott, Ontogeny, 
personal communication). Ptcl e:?q>ression does not appear to be altered by LacZ 
insertion and expression corresponds to ptcl expression in embryos (M. Scott, 

15 Ontogeny, personal communication). Heterozygous Ptcl-lacZ mice and their wild ^pe 
littermate controls are generated by mating heterozygote lacZ positive males with 
standard C57BL/6J female mice (Taconic, Germantown, NY). Adult Ptcl-lacZ mice 
were fixed by cardiac perfusion followed by drop fixation of heart or vascular tissues for 
1-2 hours in 0.2% gluteraldehyde, 5mM EDTA, 2mM MgCb, O.IM sodium phosphate, 

20 pH8. Pup tissues and small tissues were directly drop fixed in gluteraldehyde for 1-2 
hours. Following fixation, the tissues were washed 3 times for 20-30 min in 2mM 
MgCl2, 0.01 deoxycholate, 0.02% NP40, 50mM sodium phosphate pH8. The tissues 
were then stained overnight at 37*^0 in Img/ml 5-Bromo-4-chloro-3-indolyl -D- 
galactopyranoside PCgal) (Sigma, St. Loiiis, MO), 5 mM potassium ferricyanide, 5mM 

25 potassium ferrocyanide, 2mM MgCh, 0.01 % deoxycholate, 0.02% NP40, 50mM 
sodium phosphate pH8. The tissues were visualized either as whole mounts or 
embedded in paraffin and prepared as light eosin-stained 5 micron sections. 

Patched 1 is expressed in the endothelial cells of the aorta, some vascular smooth 
muscle cells (vSMC) and adventitial fibroblasts of the aorta (photomicrographs not 

30 presented here). In addition, coronary vasculature and cardiomyocytes of the atria and 
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ventricles also express ptcl. These expression patterns suggest that cells in nonnal 
vascular and cardiovascular tissues may be responsive to or responding to hedgehog. 

Normal Vasculature and Cardiovascular Tissues are Hedgehog Respoftsive 

5 We determined that normal vascular and cardiovascular tissues are indeed 

responsive to exogenous hedgehog administration by injecting Ptcl-lacZ mice 
systemically with hedgehog. PtcHacZ mice were injected daily subcutaneously with the 
indicated amoimts of polyethylene glycol 20,000-conjugated A192C sonic hedgehog n- 
terminal protein (PEG-Shh) (Pepinsky et al, 2000) or its vehicle (PBS). This form of the 

10 protein also contains a mutation of the n-terminal cysteine residue to isoleucine- 
isoleucine which significantly improves the specific activity of hedgehog protein 
(Pepinsky et al, 1998; Taylor et al, in prep). 

Mice injected with hedgehog protein for 3 days showed no obvious physical or 
behavioural differences compared to vehicle-treated or imtreated littermates. 

15 Specifically, PtcHacZ mice were injected (s.c.) once daily with PEG-Shh for 3 days 
starting at postnatal day 6 then sacrificed at postnatal day 9; selected organs were 
dissected and whole mount stained by X-Gal histochemistry. Mice were treated with 
vehicle, 3mg/kg PEG-Shh or 6mg/kg PEG-Shh for 3 days and were sacrificed on the 
fourth day. Vascular and cardiovascular tissues were dissected and whole-mount stained 

20 with Xgal. The vascular and cardiovascular staining pattern for ptcl seen in normal 
animals intensifies significantly in animals injected with increasing doses of hedgehog 
protein (data not presented here). Whole mount Xgal staining of the coronary arteries, 
atria and ventricles are increased in a dose dependent maimer in the hearts and in the 
aortic wall of the RcUacZ mice injected with hedgehog. In contrast, wild type 

25 littermate mice injected vsdth the highest dose of hedgehog (6mg/kg) show no staining 
suggesting that the staining seen in the RcI-lacZ animals is not due to endogenous 
betagalactosidase. Histological sections of these tissues show that the lacZ positive cells 
in the Ptcl-lacZ mice treated with hedgehog are similar to those which are positive in the 
vehicle-injected group and in normal adtilt hearts and aortas from untreated animals. 

30 Though the same type of cells appear to stain with Xgal in the treated animals, there 
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appears to be an increase in the number of these cells especially in the adventitia. These 
data show that systemic administration of hedgehog can induce ptcl upregulation and 
indicate that these vascular tissues are responsive to hedgehog protein. 

Example 2: Hedgehog Induces Neovascularization in Matrigel Plug Model of 
Angiogenesis 

Hedgehog was also found to induce angiogenesis in the subcutaneous matrigel 
plug assay (Passaniti et al., 1992). Doses of 2 to lOug/ml of octyl, myr, PEG n or II-Fc 
fusion forms of human recombinant Shh were prepared in 0.5ml of matrigel containing 
40 lU/ml of heparin and injected subcutaneously into C57BL6 mice (3-5mo. old, 5 
mice/treatment group). The mice were sacrificed between 6-7 days after injection and 
the matrigel plug was dissected for visual inspection and histological analysis. Plugs 
containing hedgehog induced significant angiogenesis in the plug and surrounding tissue 
in 4 of 6 plugs at 2ug/ml and 5 of 6 plugs at lOug/ml whereas only 2 of 9 vehicle 
containing plugs showed any evidence of angiogenesis (data not presented here). 
Recombinant human bFGF, a known angiogenic protein, also showed significant 
hemoglobin content in 3 of 5 implants (data not shown). The results of the matrigel plug 
support the finding that hedgehog can induce angiogenesis in vivo. 

Example 3: Hedgehog Induces Neovascularization in Corneal Model of 
Angiogenesis 

The mouse cornea is avascular and can be used to demonstrate angiogenic 
activity by measuring the amount of vessel growth into this avascular tissue after 
surgical placement of a polymer pellet containing an angiogenic substance or growth 
&ctor into the comea (Kenyon et al., 1996; Asahara et al., 1997). To confirm the 
angiogenic activity of hedgehog in another well accepted model of angiogenesis, we 
tested the ability of hedgehog protein to induce neovascularization in the mouse corneal 
model pocket model of angiogenesis. 

Animals were anesthetized by pCTtobarbital intraperitoneal injection (160 
mg/kg). Corneal pockets were created in the eyes of each mouse and a 0.34 X 0.34 tnm 
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sucrose albumin sul&te ^ukh Meditec, Vaerlose, DK) pellet coated with hydron 
polymer type NCC (Interferon Sciences, New Brunswick, NJ) containing 1 of the agents 
indicated below was implanted into the corneal pocket CS7BL/£J mice were divided 
into 5 groups: control buffer alone; VEGF 300 ng/pellet; Myr-Shh vehicle alone; Myr- 

5 Shh 1.5 microg/pellet 39; Myr-Shh+VEGF (1.5 microg/pellet +300 ng/pellet, 
respectively). Pellets were positioned 1.0 mm from the corneal limbus, and 
erythromycin ophthalmic ointment (E. Fourera) was applied to each operated eye. The 
comeas of all mice were routinely examined by slit-lamp biomicroscopy on 
postoperative day 6 after pellet implantation. 

10 On the same day vessel length and comeal circumferential neovascularity (in 

degrees) were measured. After completing these mesisurements, C57BL/6J mice 
received an intravenous injection of 500 pg of BS-1 lectin FITC-conjugated (Vector 
Laboratories, Burlingame, CA). Thirty minutes later, the animals were sacrificed. The 
eyes were enucleated and fixed in 1 % paraformaldehyde solution. After fixation, the 

IS comeas were placed on glass slides and examined by fluorescence microscopy. Several 
C57BL/6J mice in each group did not receive BS-1 lectin injection; instead, the eyes 
were excised and fixed in 100% methanol solution for immunohistochemical staining. 

There was significant neovascular growth in the Shh and in the VEGF groups 
but not the vehicle-containing pellet groups. There was a striking qualitative difference 

20 in the appearance of vessels induced by hedgehog compared to VEGF 
(photomicrographs not presented here). VEGF induced a fine mesh of capillaries which 
axe short tortous sprouts from the extended branches of the preexisting limbus vessels at 
the base of the eye. In contrast, hedgehog induced much larger vessels which extended 
all the way to the pellet and contained numerous anastamoses between the venous and 

25 arterial circxilation. Histological analysis confirmed that hedgehog induced larger 
diameter vessels than VEGF. Hedgehog induced vessels often were filled with red blood 
cells whereas VEGF induced vessels had few or no red blood cells. 

Measurements (meaniistandard error of the mean) of the VEGF and hedgehog 
vessels confirmed that hedgehog-induced vessel diameters (mean 33±17um) were 

30 significantly larger than VEGF vessel diameters (mean 8±3um) (p<0.0001)). The 
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maximum vessel lengths induced by hedgehog (1020+200iim) were also significanfly 
greater than the maximum length of vessels induced by VEGF (700±70um) (p<0.0001). 
The density of vessels induced by hedgehog was slightly lower than the density of 
vessels in the corneal tissue exposed to VEGF as may be expected from the large 
5 number of small capillaries formed by VEGF (p<0.0001). All group differences were 
analysed by ANOVA and differences with p<0,05 were considered statistically 
significant. 

In summary, neovascularization induced by Shh was characterized by a 
statistically significant increase in vessel length, circuxnferential neovascularity and 

10 diameter of the Ixmiens; the mean number of vascular lumens per cross section was 
higher in the VEGF-treated corneas. Neovascularization induced by Shh+VEGF showed 
a large variability in the lumen diamet^ of these vessels ranging from small capillaries 
(6-7 gm) to large diameter vessels (80 gm). The combination of VEGF and Shh appears 
to create a composite of characteristics of both VEGF and Shh neovascular growth. 

IS These results confirm hedgehog protem can induce angiogenesis in vivo and suggest 
that hedgehog either alone or in combination with VEGF or other angiogenic growth 
factors such as bFGF, the angiopoietins and TWEAK [Lynch CN, Wang YC, Lund JK, 
Chen YW, Leal JA, Wiley SR. TWEAK induces angiogenesis and proliferation of 
endothelial cells. J Biol Chem, 1999 Mar 26;274(13):8455-9] can have therapeutic 

20 utility by inducing fimctional neovasculature. 

Example 4: Biological Activities Induced By Hedgehog -Responsive Mesencliymal 
Cells 

Hedgehog induces stromal fibroblasts and VEGF upregulation in the corneal model of 
25 angiogenesis 

To determine the mechanism by which Shh induces angiogenesis both Shh and 
VEGF-stimulated comeas (see Example 3) were excised and X-gal stained as described 
in Example 1 after fixation of the whole eye for 1 hour in 1 % paraformaldehyde 
followed by enucleation and fixation of the corneal hemisphere in 1 % 
30 paraformaldehyde for 30 minutes. VEGF-induced comeas did not stain with X-gal, 
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indicating that VEGF does not induce Ptc 1 expression during neovascularization. In 
contrast, strong X-gal staining was detected in the neovascular regions of PtcUacZ 
corneas treated vsdth Shh (data not presented here). Histologic analysis following 
parafBn embedding of X-gal-stained corneas and preparation of immunostained Sum 

5 sections with showed that the X-gal positive cells were not endothelial cells or smooth 
muscle cells, but fibroblasts surrounding tiie neovessels. Endothelial cell 
inimunostaining was done with a rat monoclonal antibody against mouse CD-31 
(Phannigen, San Diego, CA) followed by a biotinylated goat anti-rat immunoglobulin 
secondary antibody. Smooth muscle cells and pericytes were identified with a mouse 

10 monoclonal antibody against SM a-actin conjugated with alkaline-phosphatase (Sigma, 
St. Louis, MO) and fibroblasts were identified using an anti-vimentin antibody (Sigma, 
St Louis, MO). 

We then immunostained the Shh-induced corneas with a rabbit polyclonal anti- 
VEGF antibody (Santa Cruz Biotechnology, Santa Cruz, CA) with a biotinylated goat 
15 anti-rabbit immunoglobulin as secondary antibody. The results show that VEGF protein 
is in the fibroblasts and matrix immediately adjacent to the neovascular area. These 
results suggested that hedgehog may induce resident fibroblasts in the cornea to produce 
angiogenic factors such as VEGF. 

20 Fibroblasts in vitro respond to hedgehog stimulation by upregulation of Ptcl and 
angiogenic growth factors 

To detemiine if hedgehog can directiy induce fibroblasts to produce VEGF or 
other angiogenic factors , we treated normal human fibroblasts (CCD37) with Myr-Shh 
and the ability of fibroblasts to respond was evaluated by competitive RT-PCR for ptc 1 

2S and several angiogenic growth &ctors. Total RNA was prepared firom cells treated as 
described above using Trizol (Life Technologies, Rockville, MD). Four micrograms of 
total RNA was used to prepare cDNA using the SuperScriptTM preamplification system 
(Cat No. 18089-011, Life Technologies, Rockville, MD). The PGR reaction using 
buffer reagents firom the SuperSaiptTM preamplification system (Life Technologies, 

30 Rockville, MD) was quantitated with 20S rKNA competitive primers (Ambion). Primers 
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for the amplification of Ptc 1' were 5'-TCAGGATGCATTTGACAGTGACTGG-3' 
(SEQ ID NO: 38) and 5*- ACTCCGAGTCGGAGGAATCAGACCC-3' (SEQ ID NO: 
39) which are based on ptcl cDNA sequence (GenBank Accession Number U46155), 
All amplification for Rcl w^e done with 25 cycles of 94°C for 30 sec; 55**C for 1 min; 

5 72°C for 1 min. The cDNA from the same cells was also used as a template for VEGF, 
bFGF, Angiopoietin 1, and Angiopoietin n amplificadojo. The following primer pairs 
and PGR cycles were used: VEGF: 5'CGAAGTGGTGAAGTTCATGGATG3' (SEQ ID 
NO: 40) and 5'TTCTGTATCAGTCmCCTGGTGAG3' (SEQ ID NO: 41) which are 
based on the human VEGF cDNA sequence (GenBank Accession Number El 5 157). 

10 VEGF product was amplified with 30 cycles of 94°C for 30 sec; 62^C for 1 min; 72°C 
for hnin; bFGF: 5TACAACTrCAAGCAGAAGAG3* (SEQ ID NO: 42) and 
5'CAGCTCTTAGCAGACATTGG3' (SEQ ID NO: 43) which is based on the human 
bFGF cDNA sequence (GenBank Accession Number M27968). bFGF product was 
amplified with with 25 cycles of 94^C for 30 sec; 62^C for 1 min; 72*^0 for 1 min; 

15 Angiopoietin I 5*CAACACAAACGCTCTGCAGAGAGA3' (SEQ ID NO: 44) and 
5'CTCCAGTTGCTGCTTCTGAAGGAC 3' (SEQ ID NO: 45) which is based on human 
Angiopoietin I cDNA sequence (GenBank Accession Nimiber U83508), Angiopoietin I 
product was amplified with 25 cycles of 94°C for 30 sec; 64*^C for 90 sec; Angiopoietin 
n: 5'AGCGACGTGAGGATGGCAGCGTT3' (SEQ ID N0:46) and 

20 5'ATTTCCTGGTTGGCTGATGCTGCTT3' (SEQ ID NO: 47) which are based on 
human Angiopoietin II cDNA sequence (GenBank Accesion Nmnber AB009865). 
Angiopoietin n product was amplified with with 32 cycles of 94*^C for 30 sec; 64°C for 
90 sec. As internal control for sample preparation, gel loading, and random variations in 
RT-PCR, 18S rRNA primers and 18S rRNA competimers (Ambion, Austin, TX), used 

25 to modify 18S cDNA amplification efficiency, were included in each PGR reaction with 
target gene-specific primers. The linear range of amplification and optimal IBS 
primer/Competimer ratio was determined for each target gene following tiie 
manufacturer's recommendations (Ambion, Austin, TX). 

A time course of Shh induction shows that human fibroblast respond to Shh by 

30 iq>regulating tiie Ptcl gene (data not shown) indicating that these cells can respond via 

-121- 



wo 01/98344 



PCT/USOl/19435 



fhe known Hh signalling pathway. Neither hmnan umbilical vein and microvascular 
endothelial cells respond to Hh (data not shown). 

We next found that Hh can upregulate fibroblast expression of angiogenic 
growth factors, including VEGF, bFGF, Ang-1, and Ang-2 (data not shown). VEGF 

5 mRNA from human fibroblasts was significantly increased by Shh: all the three VEGF 
isofonns (VEGF121, 165, and 189) were strongly upregulated. VEGF 121, 165, and 189 
upregulation began at 12 hours and was maximal after 48 hours of incubation of the 
cells with Shh. No bFGF upregulation was detectable at any time-points. Moreover, 
quantitative RT-PCR for Ang-1 and Ang-2 showed upregulation of both genes, with 

10 maximal increase after 36 hours stimulation. To show that the upregulation of VEGF 
mRNA correlated with an increase in protein production, the concentration of VEGF165 
in cell media was measured by ELISA. Cells were stimulated with recombinant human 
myristolated Shh protein as described above. At harvest, the cell conditioned media was 
collected, centrifuged to remove cell debris (15 minutes at ISOOxg) and production of 

15 VEGF 165 protein was evaluated by using an EUSA kit (Quantikine human VEGF, 
R&D Systems, Minneapolis, MN). Total VEGF protein level underwent a progressive 
increase following Hh stimulation and a significant upregulation in the VEGF 
production was detectable at 72 hours (data not shown). 

20 Smooth muscle cells tq?regulate ptcl and are induced to proliferate in vitro in response 
to hedgehog 

We found that smooth muscle cells can also respond to Hh proteins in vitro. 
Eighty five percent confluent monolayers of vascular smooth muscle cells (PAC 1) were 
induced for 2 days with lug/ml of myrShh or an equivalent volimie of vehicle in normal 

25 media (M 199 complete media with 10% fetal bovine serum). For comparison, primary 
normal human lung fibroblaste and normal prostate stromal cells were grown in 
complete FBM and similarly stimulated (Clonetics/Bio-Whittaker, WalkersviUe, MD). 
The cells w^ harvested and RNA fix3m the cells was prepared and analysed by RT- 
PCR as above. All of these cells showed increased ptc 1 expression following induction 

30 with myrShh, but not myrShh vehicle alone suggesting that each of these cell types are 
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responsive to hedgehog (data not shown). In addition, hedgehog protein induced DNA 
synthesis in quiescent vascular SMCs and human fibroblasts. PAC-1 (Rothman et al., 
1992), WKY O-emire et al., 1994), primary pulmonary artery SMCs or aortic SMCs 
(Clonetics/Bio-Whittaker, Walkersville, MD) were plated (5xl03/well) m 96 well plates 

5 and allowed to adhere for 2-3 hours in O.lSml of complete media (M 199 with 10% fetal 
bovine serum for PAC 1 cells, DMEM with 10% fetal bovine serum for WKY cells or 
smGM-2 for primary human pulmonary artery or aortic SMCs). The cells were then 
starved for 18- 24 hours in complete media with 0.5% fetal bovine serum. Quiescent 
cells were stimulated with 0.1 to 40ug/ml of Hh proteins in 0.2ml starvation media for 

10 48 hours after vMch the cells were pulse labeled with 4.5uCi/ml 3H-thymidine 
(Amersham, ) for 4-8 hours at 37°C. The media was then removed, the cells washed 
with PBS then trypsinized. 3H-thymidine uptake into cells was determined by 
scintillation counting using a 1205 Betaplate counter (Wallac, Gaithersburg, MD). 
Vascular SMCs showed increased 3H-thymidine uptake 3 to 4-fold when induced by 

15 either myrShh (myristylated Sonic hedgehog) Dhh or basicFGF (obtained from Upstate 
Biotechnology, Lake Placid, NY). 

These resxilts show that both SMCs and fibroblasts respond to hedgehog. 
Although no smooth muscle cells were found in the hedgehog-stimulated comeas (see 
Example 1 and 4), the responsiveness of SMCs to Hh in vitro correlates well to normal 

20 ptcl expression and increased ptcl in the response by normal vascular SMCs to 
systemically administered Hh protein (See Example 3). 

Example 5: Hedgehog Improves Recovery from Ischemic Limb Injuiy 

Peripheral vascular disease caused by atherosclerosis and/or diabetes can be 
25 modeled in rodents and rabbits by surgical Ugation of the femoral artery and removal of 
a segment of tiie artery distal to the ligation (Takeshita et al., 1994 and 1996; Rivard et 
al., 1999; CoufSnhal et al., 1999). The limb ischemia produced by the ligation also 
results in limb neuropathy (Schratzberger et al., 2000). Ischemic injury of healthy 
animals and humans activates a number of pathways A^ch subsequently induce the 
30 regeneration and recovery of the damaged tissue. For example, VEGF is induced in 
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response to hindlimb ischemia and can accelerate recovery when given 
pharmacologically following this ischemic insult (Schratzberger et al., 2000). We 
investigated the possibility that the hedgehog pathway is activated in response to limb 
ischemia in normal animals and is beneficial both in the endogenous and 

5 pharmacological settings to revascularization and recovery fi'om ischemic neuropathy. 

The expression of ptcl following hindlimb ischemia was investigated in 3-4 
month old Ptel-lacZ mice (Rivard et al., 1999). The mice were anesthetized with 
pentobarbital (1 60mg/kg i.p.) and an incision was made in the skin overlying the middle 
portion of the left hindlimb. Both the proximal end of the femoral artery and the distal 

10 portion of the saphenous artery were ligated and the artery and all side branches were 
dissected free and excised. The skin was closed with a surgical stapler and the animals 
were allowed to recover. The mice were either left \mtreated or injected daily or every 
other day i,m. in the ischemic limb with hng/kg of E-Shh/mouse IgGI Fc ftision protein. 
Seven days after induction of ischemia, the animals were sacrificed and the iq)per 

15 hindlimb was isolated and whole mount stained with Xgal. Comparison of the 
contralateral upper hindlimbs (right) to the ischemic hindlimbs (left) shows a significant 
upregulation of ptcl expression (data not shown). Ischemia alone induced upregulation 
of ptcl expression in the ischemic limb and increasing frequency of hedgehog injection 
ftirther increased ptcl expression in the ischemic limb muscle. Histological sections of 

20 the ischemic and control hindlimb muscle showed muscle fiber degeneration and edema 
in the ischemic versus nonischemic tissue (data not shown). In addition, the ischemic 
muscle has a number of ptcl-e?q>ressing QCgal-stained) stromal cells in the interstitial 
areas between the muscle fibers. These cells which appear to be responding to hedgehog 
were shown to be fibroblasts identified by costaining with vimentin and X-gal or 

25 monocytes/macrophages identified by costaining with the moma2 antibody and X-gal 
(see Example 4 for Methods). These results show that the hedgehog pathway may be 
part of the normal response to ischemia which may be augmented by pharmacological 
administration of hedgehog protein. 

The relevance of hedgehog upregulation following ischemia is determined by 

30 inhibiting hedgehog action with a blocking antibody to hedgehog. Unilateral hindlimb 
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ischemia was induced in nonnal mice (C57BL6, 3-4montiis of age, female). The mice 
are treated with lOmg/kg daily 3 days prior to ischemia and 2.5-5mg/kg every 3 days 
following ischemia for 3 weeks with either the blocking antibody to hedgehog (5EI, 
Developmental Studies Hybiidoma Bank, Karen Jensen, Department of Biological 

5 Sciences, The University of Iowa, 007 Biology Building East, Iowa City, lA 52242, tel; 
(319)335-3826, fax: (319)335-2077, 5E1 available for order on website: 
www.uiowa.edu/-dshbwww/l*ndex.html, e-mail: dshb@iuiowa.edu) or an isotype 
matched control mouse monoclonal antibody. 

The vascular perfusion of the ischemic vs contralateral limb is assessed at days 

10 4, 7, 14, 21 and 28 days by lasar doppler (Lisca, Inc. laser Doppler perfusion imager 
system) (Rivard et al., 1999). Nerve vascular perfusion is determined by exposing the 
sciatic nerve and scanning the nerve surfece area using lasar doppler or by injection of 
Fluoresceinated-BS 1 lectin (Vector Laboratories, Burlingame, CA) 30 minutes prior to 
sacrifice and visualizing the vaso nervorum by w^ole mount fluorescence microscopy 

15 postmortem (as described above). Vascular density is assessed at these times by 
histological staining for CD31 positive vasculature in sections (anti-murine CD31, 
Phanningen, San Diego, CA) (Rivard et al., 1999). Neuropathy is assessed at these tune 
points by nerve conduction measurements of the sciatic/peroneal nerves using standard 
orthodromic surface recording techniques and a Teca TD-10 portable recording system 

20 (Oxford Instruments, Concord, MA). Angiogenesis as measured both by vascular 
perfusion or vascular density is decreased in ischemic limbs of animals treated with 
hedgehog blocking antibody, SEl, compared to ones treated with the isotype matched 
control, 1E6. Nerve conduction measurements are also decreased in SBl-treated mice 
compared to control antibody-treated mice. Finally, nerve vascular perfusion is 

25 decreased in the SEl -treated mice. These results suggest that the upregulation of the 
hedgehog pathway following ischemia is a beneficial compensatory response to 
ischCTdic injury. 

The utility of treating ischemia by activating the hedgehog pathway is tested in 
aged mice (>2yrs old) or apoE null mice with surgically induced limb ischemia since 
30 these mice are deficient in their repair and regeneration processes following limb 
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ischemia. These mice are made ischemic then injected (i.v., i.p., s.c. or i.mO with doses 
ranging j&om 0-lOmg/kg of hedgehog protein or eqmvalent vohmies of vehicle control or 
control protein beginning on the day of surgery and with a frequency of daily to 3 times 
per week. The vascular perfusion, vascular density and neuronal conduction and 

5 neuronal vascularity (vaso nervorum) of the ischemic vs contralateral limb are assessed 
at days 4, 7, 14, 21 and 28 postsurgery as described above. The results show that 
hedgehog-treated animals show significant improvements in vascular perfusion, 
vascular density as well as motor nerve conduction and their vaso nervorum compared 
to control treated animals (data not presented). 

10 Hedgehog can also be delivered using gene tiierapy. Either full length or soluble 

Nterminal Shh adenovirus (10^ to 101^^ particles) is injected i.m. at day 1 postinjury in 
the inguinal area of Ihe upper hindlimb following surgery. Alternatively, the full length 
or soluble n-terminal Shh adenoassociated virus (AAV) or a control LacZ AAV is 
administered 4 weeks prior to surgery. Similar doses of adenovirus containing full 

15 length or n-terminal Shh or LacZ containing control adenovirus can be administered in 
place of AAV-Shh. Above endpoints for vascular and motor neuron conduction 
improvements are also seen with viral gene therapy. 

Together these results show that the hedgehog pathway is a crucial component of 
the normal angiogenic response, tissue regeneration and recovery fix>m ischemia injury 

20 and that hedgehog proteins can induce angiogenesis and improve recovery from 
ischemia when used pharmacologically. 

Example 6: Hedgehog Induces Collateral Vessel formation and Improved 
Myocardial Function following Surgically Induced Myocardial Ischemia 
25 Chronic myocardial ischemia and collateral vessel formation can be modeled in 

, pigs through the placement of an ameroid constrictor around the left circumflex 
coronary artery. Treatment of these ischemic hearts with angiogenic proteins can 
increase myocardial vascularity, perfusion and function in the ischemic area as well as 
overall heart function. We determine that hedgehog protein or gene therapy can also 
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improve fhese measures of cardiac perfusion, viability and function follovdng ischemia 
in the following experiments. 

Ameroid constrictors are placed around the left circumflex coronary artery 
(LCX) of anesthetized Yorkshire pigs (5-6 weeks old, 15-1 8kg, male or female) (Laham 
et al,, 2000; Harada et al., 1994; Unger et al., 1994). The animals are allowed to recover 
for 3 weeks to allow time for ameroid closure. Either immediately after or 3 weeks post- 
ameroid placement, the animals are randomized into one of several groups (10 
animals/group). Hedgehog or control is administered by one of the following routes: 

1 . direct injection of ischemic myocardium with hedgehog or saline 

2. intrapericardial administration of hedgehog protein or saline 

3. systemic administration of hedgehog protein or saline (s.c, i.m. or i.v. injection) 

4. myocardial injection of hedgehog in (0.1 -5mg) heparin or heparin alone following 
thoracotomy or via an injection catheter (Cordis- Webster) 

5. intrapericardial injection of hedgehog in (0.1-5mg) heparin 

6. intracoronary catheter delivery device 

7. viral gene therapy via above methods using 10^-10^^ Particles of full length or n- 
teraiinal Shh adenovirus in a single or several bolus injections (O.hnl-hnl/injection). 
Heart muscle per&ision and function are monitored using several techniques 
immediately prior to the Hedgehog treatments and 2-4 weeks post-Hedgehog treatments. 
Coronary perfusion was determined by right and left coronary angiography. 

To obtain a collateral index, left to left and right to left coronary collaterals are 
measured. Regional resting myocardial blood flow is measured using colored 
microspheres. Magnetic resonance imaging of wall thickening is used to determine 
global ventricular, ischemic/normal regional function and myocardial perfiisioiL 
Electromechanical left ventricular mapping is done using the NOGA system (Biosense, 
Johnson&Johnson, Warren, NJ) to determine localized heart function (Vale et al., 1999, 
Komowski, Hong and Leon, 1998). In addition, complete autopsies and histopathology 
is done on each animal for coronary tissues (pericardium, epicardial coronary artery, 
myocardium in the left anterior descending artery distribution (normal tissue), left 
circiunflex artery distribution, (ischemic tissue) and peripheral organs (gastrointestinal 
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tract, lung, liver, kidney, bone, bone marrow)). Improvements in heart muscle perfusion 
and function as well as histological analysis of coronary tissue vascularization are 
assessed. Hedgehog treatments can show improvement in these parameters when 
compared to control treatments suggesting therapeutic utility for hedgehog treatments in 
5 myocardial infarction and coronary artery disease. 

Example 7: Inhibition of Hedgehog (Anti-hedgehog blocking antibody) Decreases 
Tumor Growth Rate and/or Tumor Angiogenesis 

To determine if tumor cell lines overejqpress hedgehog protein, anti-hedgehog 

10 antibody was used to immunoprecipitate cell lysates of various tumor cell lines. We 
used gastrointestinal epithelial cell lines as an example: T84 (human colon epithelial 
carcinoma, CCL-284, ATCC, Manassas, VA); Caco2 and SW480 (Tnmian colon 
epithelial adenocarcinomas, HTB-37 and CCL-228, ATCC, Manassas, VA), Briefly, 
- one milligram amounts of cell lysis supematant were immimoprecipitated with either 

15 anti-hedgehog antibody, 5E1 (+) or an isotype matched control antibody, 9E10 (C). The 
immunoprecipitated samples were analysed by western blotting with an anti-hedgehog 
rabbit polyclonal antibody, rl200. 

More specifically, confluent monolayers of each cell line in T 150 flasks were 
lysed m 3mL of cold lysis buffer (1% Triton X-100, 0.5% sodium deoxycholate, 0.% 

20 SDS, 150mM NaCl, ImM sodium vanadate, 10% glycerol, lOmM Tris-HCL, pH 8.0) 
containing a 2x concentration of Complete protease inhibitor cocktail (Boehringer 
Mannheim, Indianapolis, IN). The lysate was rocked for 30' at 4°C then scraped into a 
microfuge tube and debris pelleted in a microfiige for 10\ The stqpematant was stored at 
-80*^C. Protein concentration of the siipematants were determined using Bio-Rad Protein 

25 Assay reagent and equivalent milligram amounts of supematant were used for each 
immunoprecipitadon. Each sample was gently agitated overnight at 4®C with 2.5 ug of 
either anti-hedgehog antibody, 5E1, or an isotype matched control antibody, 9E10 (anti- 
human c-myc, Calbiochem, San Diego, CA) (Fan et al., 1998). Protein A conjugated 
Sepharose beads (30 microliters packed beads/sample) were added to each sample and 

30 the samples were gently agitated at 4°C for 30-40 minutes. The beads and associated 
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immune complexes were then spun down in a microfuge for 10 seconds and washed 4 
times with 1 ml of ice cold lysis buffer. The buffer was then removed from the beads, 
reducing SDS-PAGB sample buffer was added, the samples were heated to 90^C for 5 
minutes then analyzed by SDS-PAGE (4-20% Tris-glycine gels, Novex, San Diego, 
CA). The proteins were transferred to nitrocellulose filters and western blot analysis was 
performed at room temperature. 

The nitrocellulose filters was incubated with blocking solution (5% dry milk in 
Tris-bufifered saline with 0.3% Tween-20) for 1 hour followed by blocking solution 
containing a 1:10,000 dilution of anti-hedgehog rabbit polyclonal, rl200, for 2-3 hours 
at room temperature or overnight incubation at 4^C. The nitrocellulose filters were 
washed 3 times with Tris-buffered saline with 0.3% Tween-20; incubated for 1 hour in 
1:5000 dilution of horseradish peroxidase-conjugated goat anti-rabbit antibody (Jackson • 
Immunoresearch) then visualized using ECL western blotting detection reagents 
(Amersham Pharmacia Biotech). 

Hedgehog protein is overexpressed in several human gastrointestinal tumor cell 
lines compared to nomoial human gastrointestinal epithelial cells or fibroblasts (data not 
shown). The anti-hedgehog antibody immunoprecipitations show a hedgehog rabbit 
polyclonal antibody-reactive band at 19kD, the expected molecular weight for hedgehog 
protein. Hie control antibody (9E10) immunoprecipitation shows no hedgehog 
polyclonal antibody-reactive band comigrating with hedgehog protein standard at 19kD. 
Normal gastrointestinal epithelial also express a low level of hedgehog protein, but 
normal gastrointestinal fibroblasts do not show any expression. None of the epithelial 
cell lines tested respond to hedgehog (data not shown), but the hedgehog produced by 
these tumor cells may activate angiogenesis via induction of stromal tissue in the tumor. 

The ability of hedgehog-blocking or hedgehog pathway-blocking reagents such 
as the anti-hedgehog blocking antibodies (SEl, ARG6, ALC9 or BH.E4) to inhibit 
tumor angiogenesis and tumor growth are determined in subcutaneously-implanted 
tumor models in athymic Swiss (Cr:NIH(S)-nu) or athymic random bred (NCr-nu) mice 
of a single sex (males >18g or females > 17g, all within a 4g weight range). Carcinoma 
cell lines of gastrointestinal origin such as SW480, HT29 or T84 are passaged in nude 
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mice as subcutaQeous tumors or are passsaged in culture as cell monolayers. Bither 2x 
10^ cells or tumor 20-40mg firagments of a passaged tumor are implanted 
subcutaneously in the axillary region of 6-10 athymic mice. Tumors were monitored 
ftequently for progressive growth. Treatments are initiated when individual tumors 

5 range between 100 mg - 700 mg. Mice are randomized into test and vehicle control 
groups and treated with either hedgehog blocking antibodies, control isotope-matched 
antibody, no treatraient or cisplatinum. Antibodies were administered (25-lOOmg/kg 
bolus i.p. injections) at a frequency of every day to 3 times a week for the follow-up 
period. Cisplatinum was administered subcutaneously three times a week ( 2 mg/kg). 

10 Body wei^ts and tumor measurements (width and length) are recorded at 3 - 5 day 
intervals following treatment for 7-21 days. Tumors are collected on the final day for 
histological analysis. Mean tumor weight change and/or mean vascular density are 
decreased in the hedgehog blocking antibody-treated group compared to the control 
antibody-treated group. In addition, hedgehog blocking antibodies may be administered 

IS prior to tumor implantation and tumor growth rate is monitored as described to 
determine if early tumor growth rates are decreased by blocking hedgehog signalling. 

Example 8: Productioit and Expression of HH-Ig fusions 
MATERIALS AND METHODS 

20 Construction of pUB55, expression plasmid for Sonic Hedgehog in Pichia pastoris: 

pUB55 contains the N-terminal domain of himian Sonic Hedgehog (SEQ ID 
NO: 21 in Table 4) with the alpha factor PrePro region as the secretion signal. pUB55 
was constructed in pCCM73, a derivative of pPIC9 (obtained from Invitrogen, San 
Diego, CA) with the Kanamycin gene (HincII-HiQcII fiagment) of pUC4-K inserted at 

25 the Sphl site of pPIC9. The human Sonic hedgehog coding sequence from Earl-Notl was 
obtained firom pEAG543 which has a stop codon and Not 1 site engineered following 
Glyl97 in the coding sequence. Plasmid pCCM73 was cut with Xhol and NotI and was 
Ugated with the Earl-Notl fiagment of pEAG543 (containing the Sonic Hedgehog 
coding sequence. Table 4) and oligonucleotides [5' TCG AGA AAA GAT GCG GAC 

30 CGG GCA GGG GGT 3': SEQ ID NO: 36 and 5' CGA ACC CCC TGC CCG GTC 
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CGC ATC TTT TC 3*: SEQ ID NO: 37] that fonn a Xhol-Earl fragmeaat and create the 
appropriate coding sequence for placing Sonic hedgehog adjacent to the alpha factor 
leader sequence in frame. 

5 Expression of Desert Hedgehog in Pichia pastoris and construction of KEX2 site 
mutations: 

The Desert Hedgehog coding region in plasmid pEAG680 was modified to 
incorporate a BsrGI and an Xmal site site using the Stratagene QuikChange mutagenesis 
kit 

10 

Expression of Indian Hedgehog in Pichia pastoris and construction of KEX2 site 
mutattions: 

Plasmid pEAG6S7 is pBluescript with the Indian Hedgehog coding sequence 
with a stop codon following codon GlyXXX. pEAG658 is pBluescript with the Indian 
IS Hedgehog coding sequence and a Sail site engineered within residues suitable for fusing 
the Indian Hedgehog coding sequence (SEQ ID NO: 22) with Fc immunoglobulin 
coding sequences (SEQ ID NOS: 28-30) at the hinge region of immunoglobulins. To 
&cilitate subsequent manipulations, Spel and Xmal sites were introduced to pEAG6S8 
by site-directed mute^enesis. 

20 

Table: DNA sequences of Hedgehog N-temiinal domains and Immunoglobulin Fc 
Regions: 

Protein DNA Sequence 

25 

Human Sonic Hedgehog N- TGCGGACCGGGCAGGGGGTTCGGGAAGAGGAGG 
termmal Domain CACCCCAAAAAGCTGACCCCTTTAGCCTACAAGC 
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[SEQEDNO: 21] 

5 



10 



15 



20 



25 



Human Indian Hedgehog 

30 
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AGTTTATCCCCAATGTGGCCGAGAAGACCCTAG 

GCGCCAGCGGAAGGTATGAAGGGAAGATCTCCA 

GAAACTCCGAGCGATTTAAGGAACTCACCCCCA 

ATTACAACCCCGACATCATATItAAGGATGAAG 

AAAACACCGGAGCGGACAGGCTGATGACTCAGA 

GGTGTAAGGACAAGTTGAACGCTTTGGCCATCTC 

GGTGATGAACCAGTGGCCAGGAGTGAAACTGCG 

GGTGACCGAGGGCTGGGACGAAGATGGCCACCA 

CTCAGAGGAGTCTCTGCACTACGAGGGCCGCGC 

AGTGGACATCACCACGTCTGACCGCGACCGCAG 

CAAGTACGGCATGCTGGCCCGCCTGGCGGTGGA 

GGCCGGCTTCGACTGGGTGTACTACGAGTCCAA 

GGCACATATCCACTGCTCGGTGAAAGCAGAGAA 

CTCGGTGGCGGCCAAATCGGGAGGC 

TGCGGGCCGGGTCGGGTGGTGGGCAQCCGCCGG 
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N-terminal Domain 



5 [SEQ ID NO: 22] 



10 



15 



20 



23 
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CGACCGCCACGCAAACTCGTGCCGCTCGCCTACA 

AGCAGTTCAGCCCCAATGTGCCCGAGAAGACCC 

TGGGCGCCAGCGGACGCTATGAAGGCAAGATCG 

CTCGCAGCTCCGAGCGCTTCAAGGAGCTCACCCC 

CAATTACAATCCAGACATCATCTTCAAGGACGA 

GGAGAACACAGGCGCCGACCGCCTCATGACCCA 

GCGCTGCAAGGACCGCCTGAACTCGCTGGCTATC 

TCGGTGATGAACCAGTGGCCCGGTGTGAAGCTG 

CGGGTGACCGAGGGCTGGGACGAGGACGGCCAC 

CACTCAGAGGAGTCCCTGCATTATGAGGGCCGC 

GCGGTGGACATCACCACATCAGACCGCGACCGC 

AATAAGTATGGACTGCTGGCGCGCTTGGCAGTG 

GAGGCCGGCTTTGACTGGGTGTATTACGAGTCAA 

AGGCCCACGTGCATTGCTCCGTCAAGTCCGAGCA 

CTCGGCCGCAGCCAAGACGGGCGGC 
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Human Desert Hedgehog TGCGGGCCGGCjCCGGGGGCCGGTTGGCCGGCGC 



N-tenninal Domain CGCTATGCGCGCAAGCAGCTCGTGCCGCTACTCT 

ACAAGCAATTTGTGCCCGGCGTGCCAGAGCGGA 
CCCTGGGCGCCAGTGGGCCAGCGGAGGGGAGGG 

[SEQ ID NO: 27] TGGCAAGGGGCTCCGAGCGCTTCCGGGACCTCG 

TGCCCAACTACAACCCCGACATCATCTTCAAGGA 



TGAGGAGAACAGTGGAGCCGACCGCCTGATGAC 

CGAGCGTTGTAAGGAGCGGGTGAACGCTTTGGC 

CATTGCCGTGATGAACATGTGGCCCGGAGTGCG 

CCTACGAGTGACTGAGGGCTGGGACGAGGACGG 

CCACCACGCTCAGGATTCACTCCACTACGAAGGC 

CGTGCTTTGGACATCACTACGTCTGACCGCGACC 

GCAACAAGTATGGGTTGCTGGCGCGCCTCGCAG 

TGGAAGCCGGCTTCGACTGGGTCTACTACGAGTC 

CCGCAACCACGTCCACGTGTCGGTCAAAGCTGAT 

AACTCACTGGCGGTCCGGGCGGGCGGC 

Fc region of human IgGI- GTCGACAAAACTCACACATGCCCACCGTGCCCA 

with Asn-Ghi glycosylation GCACCTGAACTCCTGGGGGGACCGTCAGTCTTCC 

site mutation TCTTCCCCCCAAAACCCAAGGACACCCTCATGAT 
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CTCCCGGACCCCTGAGGTCACATGCGTGGTGGTG 
GACGTGAGCCACGAAGACCCTGAGGTCAAGTTC 
[SEQ ID. NO: 28] AACTGGTACGTGGACGGCGTGGAGGTGCATAAT 

GCCAAGACAAAGCCGcg^aggagc£^;tacca^cacgtacc 
gtgtggTCAGCGTCCTCACCGTCCTGCACCAGGACT 
GGCTGAATGGCAAGGAGTACAAGTGCAAGGTCT 
CCAACAAAGCCCTCCCAGCCCCCATCGAGAAAA 
CCATCTCCAAAGCCAAAGGGCAGCCCCGAGAAC 
CACAGGTGTACACCCTGCCCCCATCCCGGGATGA 
GCTGACCAAGAACCAGGTCAGCCTGACCTGCCT 
GGTCAAAGGCTTCTATCCCAGCGACATCGCCGTG 
GAGTGGGAGAGCAATGGGCAGCCGGAGAACAA 
CTACAAGACCACGCCTCCCGTGTTGGACTCCGAC 
GGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGG 
ACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCT 
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CATGCTCCGTGATGCATGAGGCTCTGCACAACCA 
CTACACGCAGAAGAGCCTCTCCCTGTCTCCCGGG 
AAA 

Fc region of murine IgGl- GTCGACGTGCCCAGGGATTGTGGTTGTAAGCCTT 
with Asn-Gln glycosylation GCATATGTACAGTCCCAGAAGTATCATCTGTCTT 
site mutation CATCTTCCCCCCAAAGCCCAAGGATGTGCTCACC 

ATTACTCTGACTCCTAAGGTCACGTGTGTTGTGG 
TAGACATCAGCAAGGATGATCCCGAGGTCCAGT 
[SEQ ID NO: 29] TCAGCTGGTTTGTAGATGATGTGGAGGTGCACAC 

AGCTCAGACGCAACCaCGGGAaGAGCAGTTCCAA 
AGCACTTTCCGCTCAGTCAGTGAACTTCCCATCA 
TGCACCAGGACTGGCTCAATGGCAAGGAGTTCA 
AATGCAGGGTCAACAGTGCAGCTTTCCCTGCCCC 
CATCGAGAAAACCATCTCCAAAACCAAAGGCAG 
ACCGAAGGCTCCACAGGTGTACACCATTCCACCT 
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10 

Fc region of murine IgG2a- 
15 with Asn-Gln glycosylation 
site mutation 

20 

[SEQ ID NO: 30] 

25 



CCCAAGGAGCAGATGGCCAAGGATAAAGTCAGT 

CTGACCTGCATGATAACAGACTTCTTCCCTGAAG 

ACATTACTGTGGAGTGGCAGTGGAATGGGCAGC 

CAGCGGAGAACTACAAGAACACTCAGCCCATCA 

TGGACACAGATGGCTCTTACTTCGTCTACAGCAA 

GCTCAATGTGCAGAAGAGCAACTGGGAGGCAGG 

AAATACTTTCACCTGCTCTGTGTTACATGAGGGC 

CTGCACAACCACCATACTGAGAAGAGCCTCTCCC 

ACTCTCCTGGTAAA 

GTCGACCCCAGAGGGCCCACAATCAAGCCCTGT 

CCTCCATGCAAATGCCCAGCACCTAACCTCTTGG 

GTGGACCATCCGTCTTCATCTTCCCTCCAAAGAT 

CAAGGATGTACTCATGATCTCCCTGAGCCCCATA 

GTCACATGTGTGGTGGTGGATGTGAGCGAGGAT 

GACCCAGATGTCCAGATCAGCTGGTTTGTGAACA 

ACGTGGAAGTACACACAGCTCAGACACAAACCC 

ATAGAGAGGATTACCAAAGTACaCTtCGGGTGGT 

CAGTGCCCTCCCCATCCAGCACCAGGACTGGATG 
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AGTGGCAAGGAGTTCAAATGCAAGGTCAACAAC 

AAAGACCTCCCAGCGCCCATCGAGAGAACCATC 

TCAAAACCCAAAGGGTCAGTAAGAGCTCCAC AG 

GTATATGTCTTGCCTCCACCAGAAGAAGAGATG 

ACTAAGAAACAGGTCACTCTGACCTGCATGGTG 

ACAGACTTCATGCCTGAAGACATTTACGTGGAGT 

GGACCAACAACGGGAAAACAGAGCTAAACTACA 

AGAACACTGAACCAGTCCTGGACTCTGATGGTTC 

TTACTTCATGTACAGCAAGCTGAGAGTGGAAAA 

GAAGAACTGGGTGGAAAGAAATAGCTACTCCTG 

TTCAGTGGTCCACGAGGGTCTGCACAATCACCAC 

ACGACTAAGAGCTTCTCCCGGACTCCGGGTAAA 



Plasmid DNA sequence 
PUB55 

GATCTAACATCCAAAGACGAAAGGTTGAATGAAACCrmTGCCAT 
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CCGACATCCACAGGTCCATTCTCACACATAAGTGCCAAACGCA 

AC 

SEQID 

AGGAGGGGATACACTAGCAGCAGACCGTTGCAAACGCAGOACCTC 

N0:31 

CACTCCTCTTCTCCTCAACACCCACTTTTGCCATCGAAAAACCAGC 

CCAGTTATTGGGCTTGATTGGAGCTCGCTCATTCCAATTCCTTC 
TAT 

TAGGCTACTAACACCATGACTTTATTAGCCTGTCTATCCTGGCC 
CC 

CCTGGCGAGGTTCATGTTTGTITATTTCCGAATGCAACAAGCT 
CCG 

CATTACACCCGAACATCACTCCAGATGAGGGCTTTCTGAGTGT 
GGG 

GTCAAATAGTTTCATGTTCCCCAAATGGCCCAAAACTGACAGT 
TTA 

AACGCTGTCTTGGAACCTAATATGACAAAAGCGTGATCTCATC 
CAA 

GATGAACTAAGTTTGGTTCGTTGAAATGCTAACGGCCAGTTGG 
TCA 

AAAAGAAACTTCCAAAAGTCGCCATACCGTTTGTCTTGTTTGG 
TAT 

TGATTGACGAATGCTCAAAAATAATCTCATTAATGCTTAGCGC 
AGT 

CTCTCTATCGCTTCTGAACCCCGGTGCACCTGTGCCGAAACGC 
AAA 

TGGGGAAACACCCGCTTTTTGGATGATTATGCATTGTCTCCAC 
ATT 

GTATGCTTCCAAGATTCTGGTGGGAATACTGCTGATAGCCTAA 
CGT 
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TCATGATCAAAATTTAACTGTTCTAACCCCTACTTGACAGCAA 
TAT 

ATAAACAGAAGGAAGCTGCCCTGTCTTAAACCTITrmTrATC 
AT 

5 CATTATTAGCTTACTTTCATAATTGCGACTGGTTCCAATTGACA 

AG 

CTTTTGATTTTAACGACTTTTAACGACAACTTGAGAAGATGAA 
AAA 

ACAACTAATTATTCGAAGGATCCAAACGATGAGATTTCCTTCA 

10 ATT 

TTTACTGCAGTTTTATTCGCAGCATCCTCCGCATTAGCTGCTCC 

AGT 

CAACACTACAACAGAAGATGAAACGGCACAAATTCCGGCTGA 
AGC 

15 TGTCATCGGTTACTCAGATTTAGAAGGGGATTTCGATGTTGCT 

GTT 

TTGCCATnrCCAACAGCACAAATAACGGGTTATTGTTTATAA 
ATA 

CTACTATTGCCAGCATTGCTGCTAAAGAAGAAGGGGTATCTCT 

20 CGA 

GAAAAGATGCGGACCGGGCAGGGGGTTCGGGAAGAGGAGGC 

ACC 

CCAAAAAGCTGACCCCTTTAGCCTACAAGCAGTTTATCCCCAA 

TGT 

25 GGCCGAGAAGACCCTAGGCGCCAGCGGAAGGTATGAAGGGAA 

GA 

TCTCCAGAAACTCCGAGCGATTTAAGGAACTCACCCCCAATTA 
CAA 

CCCCGACATCATATTTAAGGATGAAGAAAACACCGGAGCGGA 
30 CAG 
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GCTGATGACTCAGAGGTGTAAGGACAAGTTGAACGCTTTQGCC 
AT 

CTCGGTGATGAACCAGTGGCCAGGAGTGAAACTGCGGGTGAC 
CGA 

5 GGGCTGGGACGAAGATGGCCACCACTCAGAGGAGTCTCTGCA 

CTA 

CGAGGGCCGCGCAGTGGACATCACCACGTCTGACCGCGACCO 
CAG 

CAAGTACGGCATGCTGGCCCGCCTGGCGGTGGAGGCCGGCTTC 
10 GA 

CTGGGTGTACTACGAGTCCAAGGCACATATCCACTGCTCGGTG 
AA 

AGCAGAGAACTCGGTGGCGGCCAAATCGGGAGGCTGATTCGC 
GGC 

15 CGCGAATTAATTCGCCTTAGACATGACTGTTCCTCAGTTCAAG 

TTG 

GGCACTTACGAGAAGACCGGTCTTGCTAGATTCTAATCAAGAG 
GA 

TGTCAGAATGCCATTTGCCTGAGAGATGCAGGCTTCATTTTTG 
20 ATA 

CTmrrATITGTAACCTATATAGTATAGGATTTTTTTTGTCATT 
TTG 

TTTCTTCTCGTACGAGCTTGCTCCTGATCAGCCTATCTCGCAGC 

TGA 

25 TGAATATCTTGTGGTAGGGGTTTGGGAAAATCATTCGAGTTTG 

ATG 

TmTCTTGGTATTTCCCACTCCTCTTCAGAGTACAGAAGATTA 
AGT 

GAGAAGTTCGTTTGTGCAAGCTTATCGATAAGCTTTAATGCGG 
30 TAG 
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TTTATCACAGTTAAATTGCTAACGCAGTCAGGCACCGTGTATG 
AAA 

TCTAACAATGCGCTCATCGTCATCCTCGGCACCGTCACCCTGG 
ATG 

CTGTAGGCATAGGCTTGGTTATGCCGGTACTGCCGGGCCTCTT 
GCG 

GGATATCGTCCATTCCGACAGCATCGCCAGTCACTATGGCGTG 
CTG 

CTAGCGCTATATGCGTTGATGCAATTTCTATGCGCACCCGTTCT 
CG 

GAGCACTGTCCGACCGCTTTGGCCGCCGCCCAGTCCTGCTCGC 
TTC 

GCTACTTGGAGCCACTATCGACTACGCGATCATGGCGACCACA 
CCC 

GTCCTGTGGATCTATCGAATCTAAATGTAAGTTAAAATCTCTA 
AAT 

AATTAAATAAGTCCCAGTTTCTCCATACGAACCTTAACAGCAT 
TGC 

GGTGAGCATCTAGACCTTCAACAGCAGCCAGATCCATCACTGC 
TTG 

GCCAATATGTTTCAGTCCCTCAGGAGTTACGTCTTGTGAAGTG 
ATG 

AACTTCTGGAAGGTTGCAGTGTTAACTCCGCTGTATTGACGGG 
CAT 

ATCCGTACGTTGGCAAAGTGTGGTTGGTACCGGAGGAGTAATC 
TCC 

ACAACTCTCTGGAGAGTAGGCACCAACAAACACAGATCCAGC 
GTG 

TTGTACTTGATCAACATAAGAAGAAGCATTCTCGATTTGCAGG 
ATC 
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AAGTGTTCAGGAGCGTACTGATTGGACATTTCCAAAGCCTGCT 

CGT 

AGGTTGCAACCGATAGGGTTGTAGAGTGTGCAATACACTTGCG 
TA 

5 ' CAATTTCAACCCTTGGCAACTGCACAGCTTGGTTGTGAACAGC 

ATC 

TTCAATTCTGGCAAGCTCCTTGTCTGTCATATCGACAGCCAAC 

AGA 

ATCACCTGGGAATCAATACCATGTTCAGCTTGAGCAGAAGGTC 
10 TG 

AGGCAACGAAATCTGGATCAGCGTATTTATCAGCAATAACTAG 
AA 

CTTCAGAAGGCCCAGCAGGCATGTCAATACTACACAGGGCTG 
ATG 

15 TGTCATTTTGAACCATCATCTTGGCAGCAGTAACGAACTGGTT 

TCC 

TGGACCAAATATTTTGTCACACTTAGGAACAGTTTCTGTTCCGT 
AA 

GCCATAGCAGCTACTGCCTGGGCGCCTCCTGCTAGCACGATAC 
20 ACT 

TAGCACCAACCTTGTGGGCAACGTAGATGACTTCTGGGGTAAG 
GG 

TACCATCCTTCTTAGGTGGAGATGCAAAAACAATTTCTTTGCA 
ACC 

25 AGCAACTTTGGCAGGAACACCCAGCATCAGGGAAGTGGAAGG 

CAG 

AATTGCGGTTCCACCAGGAATATAGAGGCCAACTTTCTCAATA 
GGT 

CITGCAAAACGAGAGCAGACTACACCAGGGCAAGTCTCAACT 
30 TGC 
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AACGTCTCCGTTAGTTGAGCTTCATGGAAnTCCTGACGTTATC 
TAT 

AGAGAGATCAATGGCTCTCTTAACGTTATCTGGCAATTGCATA 
AGT 

TCCTCTGGGAAAGGAGCITCTAACACAGGTGTCTTCAAAGCGA 
CTC 

CATCAAACTTGGCAGTTAGTTCTAAAAGGGCTTTGTCACCATT 
TTG 

ACGAACATTGTCGACAATTGGTTTGACTAATTCCATAATCTGT 
TCC 

GTTTTCTGGATAGGACGACGAAGGGCATCTTCAATTTCTTGTG 

AGG 

AGGCCTTAGAAACGTCAATTTTGCACAATTCAATACGACCTTC 
AGA 

AGGGACTTCTTTAGGTTTGGATTCTTCTTTAGGTTGTTCCTTGG 
TGT 

ATCCTGGCTTGGCATCTCCTTTCCTTCTAGTGACCTTTAGGGAC 

TTC 

ATATCCAGGTTTCTCTCCACCTCGTCCAACGTCACACCGTACTT 
GG 

CACATCTAACTAATGCAAAATAAAATAAGTCAGCACATTCCCA 
GG 

CTATATCTTCCTTGGATTTAGCTTCTGCAAGTTCATCAGCTTCC 
TCC 

CTAATTTTAGCGTTCAAACAAAACTTCGTCGTCAAATAACCGT 
TTG 

GTATAAGAACCTTCTGGAGCATTGCTCTTACGATCCCACAAGG 
TGC 

TTCCATGGCTCTAAGACCCTTTGATTGGCCAAAACAGGAAGTG 
CGT 
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TCCAAGTGACAGAAACCAACACCTGTTTGTTCAACCACAAATT 
TCA 

AGCAGTCTCCATCACAATCCAATTCGATACCCAGCAACTTTTG 
AGT 

TCGTCCAGATGTAGCACCTTTATACCACAAACCGTGACGACGA 
GAT 

TGGTAGACTCCAGTTTGTGTCCTTATAGCCTCCQGAATAGACT 
TTTT 

GGACGAGTACACCAGGCCCAACGAGTAATTAGAAGAGTCAGC 
CAC 

CAAAGTAGTGAATAGACCATCGGGGCGGTCAGTAGTCAAAGA 
CGC 

CAACAAAATTTCACTGACAGGGAACTTTTTGACATCTTCAGAA 
AGT 

TCGTATTCAGTAGTCAATTGCCGAGCATCAATAATGGGGATTA 
TAG 

CAGAAGCAACAGTGGAAGTCACATCTACCAACTTTGCGGTCTC 

AG 

AAAAAGCATAAACAGTTCTACTACCGCCATTAGTGAAACTTTT 
CAA 

ATCGCCCAGTGGAGAAGAAAAAGGCACAGCGATACTAGCATT 
AGC 

GGGCAAGGATGCAACTTTATCAACCAGGGTCCTATAGATAACC 
CT 

AGCGCCTGGGATCATCCTTTGGACAACTCTTTCTGCCAAATCT 
AGG 

TCCAAAATCACTTCATTGATACCATTATTGTACAACTTGAGCA 
AGT 

TGTCGATCAGCTCCTCAAATTGGTCCTCTGTAACGGATGACTC 
AAC 
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TTGCACATTAACTTGAAGCTCAGTCGATTGAGTOAACTTGATC 
AGG 

TTGTGCAGCrGGTCAGCAGCATAGGGAAACACGGCTTTTCCTA 
CCA 

AACTCAAGGAATTATCAAACTCTGCAACACTTGCGTATGCAGG 
TA 

GCAAGGGAAATGTCATACTTGAAGTCGGACAGTGAGTGTAGT 
CTT 

GAGAAATTCTGAAGCCGTATTTTTATTATCAGTGAGTCAGTCA 
TCA 

GGAGATCCTCTACGCCGGACGCATCGTGGCCGACCTGCAGGTC 
GG 

CATCACCGGCGCCACAGGTGCGGTTGCTGGCGCCTATATCGCC 
GAC 

ATCACCGATGGGGAAGATCGGGCTCGCCACTTCGGGCTCATGA 
GC 

GCTTGTTTCGGCGTGGGTATGGTGGCAGGCCCCGTGGCCGGGG 
GA 

CTGTTGGGCGCCATCTCCTTGGACCTGCAGGGGGGGGGGGGG 
AAA 

GCCACGTTGTGTCTCAAAATCTCTGATGTTACATTGCACAAGA 
TAA 

AAATATATCATCATGAACAATAAAACTGTCTGCTTACATAAAC 
AGT 

AATACAAGGGGTGTTATGAGCCATATTCAACGGGAAACGTCTT 
GC 

TCAAGGCCGCGATTAAATTCCAACATGGATGCTGATTTATATG 
GGT 

ATAAATGGGCTCGCGATAATGTCGGGCAATCAGGTGCGACAA 
TCT 
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ATCGATTGTATGGGAAGCCCGATGCGCCAGAGTTGTITCTGAA 
ACA 

TGGCAAAGGTAGCGTTGCCAATGATGTTACAGATGAGATGGTC 
AG 

5 ACTAAACTGGCTGACGGAA'TTTATGCCTCTTCCGACCATCAAG 

CAT 

TTTATCCGTACTCCTGATGATGCATGGTTACTCACCACTGCGAT 
CC 

CCGGGAAAACAGCATTCCAGGTATTAGAAGAATATCCTGATTC 
10 AG 

GTGAAAATATTGTTGATGCGCTGGCAGTGTTCCTGCGCCGGTT 

GCA 

TTCGATTCCTGTTTGTAATTGTCCTTTTAACAGCGATCGCGTAT 
TTC 

15 GTCTCGCTCAGGCGCAATCACGAATGAATAACGGTTTGGTTGA 

TGC 

GAGTGATTTTGATGACGAGCGTAATGGCTGGCCTGTTGAACAA 
GTC 

TGGAAAGAAATGCATAAGCTTTTGCCATTCTCACCGGATTCAG 
20 TCG 

TCACTCATGGTGATTTCTCACTTGATAACCTTATTTTTGACGAG 
GG 

GAAATTAATAGGTTGTATTGATGTTGGACGAGTCGGAATCGCA 
GA 

25 CCGATACCAGGATCTTGCCATCCTATGGAACTGCCTCGGTGAG 

TTT 

TCTCCTTCATTACAGAAACGGCTTTTTCAAAAATATGGTATTG 
ATA 

ATCCTGATATGAATAAATTGCAGTTTCATTrGATGCTCGATGA 
30 GTT 

-147- 



wo 01/98344 



PCT/USOl/19435 



TTTCTAATCAGAATTGGTTAATTGGTTGTAACACTGGCAGAGC 

ATT 

ACGCTGACTTGACGGGACGGCGGCTTTGTTGAATAAATCGAAC 
TTT 

TGCTGAGTTGAAGGATCAGATCACGCATCTTCCCGACAACGCA 
GA 

CCGTTCCGTGGCAAAGCAAAAGTTCAAAATCACCAACTGGTCC 
AC 

CTACAACAAAGCTCTCATCAACCGTGGCTCCCTCACTTTCTGGCTG 

GATGATGGGGCGATTCAGGCCTGGTATGAGTCAGCAACACCTT 
CTT 

CACGAGGCAGACCTCAGCGCCCCCCCCCCCCTGCAGGTCCCAC 
GG 

CGGCGGTGCTCAACGGCCTCAACCTACTACTGGGCTGCTTCCT 
AAT 

GCAGGAGTCGCATAAGGGAGAGCGTCGAGTATCTATGATTGG 
AAG 

TATGGGAATGGTGATACCCGCATTCTTCAGTGTCTTGAGGTCT 
CCT 

ATCAGATTATGCCCAACTAAAGCAACCGGAGGAGGAGATTTC 
ATG 

GTAAATTTCTCTGACTTTTGGTCATCAGTAGACTCGAACTGTG 

AGA 

CTATCTCGGTTATGACAGCAGAAATGTCCTTCTTGGAGACAGT 
AAA 

TGAAGTCCCACCAATAAAGAAATCCTTGTTATCAGGAACAAAC 
TTC 

TTGTTTCGAACITITrCGGTGCCTTGAACTATAAAATGTAGAGT 
GG 

-148- 



wo 01/98344 



PCTAJSOl/19435 



ATATGTCGGGTAGGAATGGAGCGGGCAAATGCTTACCTTCTGG 
AC 

CTTCAAGAGGTATGTAGGGTTTGTAGATACTGATGCCAACTTC 
AGT 

5 GACAACGTTGCTATTTCGTTCAAACCATTCCGAATCCAGAGAA 

ATC 

AAAGTTGTTTGTCTACTATTGATCCAAGCCAGTGCGGTCTTGA 
AAC 

TGACAATAGTGTGCTCGTGTTTTGAGGTCATCTTTGTATGAATA 
10 AA 

TCTAGTCTTTGATCTAAATAATCTTGACGAGCCAAGGCGATAA 
ATA 

CCCAAATCTAAAACTCTTTTAAAACGTTAAAAGGACAAGTATG 
TCT 

15 GCCTGTATTAAACCCCAAATCAGCTCGTAGTCTGATCCTCATC 

AAC 

TTGAGGGGCACTATCTTGTTTTAGAGAAATTTGCGGAGATGCG 
ATA 

TCGAGAAAAAGGTACGCTGATTTTAAACGTGAAATTTATCTCA 
20 AG 

ATCTCTGCCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTG 
ACA 

CATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGAT 
GC 

25 CGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGG 

CGG 

GTGTCGGGGCGCAGCCATGACCCAGTCACGTAGCGATAGCGG 
AGT 

GTATACTGGCTTAACTATGCGGCATCAGAGCAGATTGTACTGA 
30 GA 
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GTGCACCATATGCGGTGTGAAATACCGCACAGATGCGTAAGG 
AGA 

AAATACCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACTGACT 
CGC 

5 TGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTC 

AAA 

GGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGG 
AAA 

GAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAA 
10 AA 

AGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGA 
CGA 

GCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCC 
GAG 

15 AGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTC 

GTG 

CGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGC 
CTT 

TCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGT 
20 AGG 

TATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTG 
TGC 

ACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAA 
CTA 

25 TCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTG 

GCA 

GCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGC 
GGT 

GCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTA 
30 GA 
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AGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCT 
TCG 

GAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCG 
CTG 

GTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAG 
AAA 

AAAAGGATCTCAAGAAGATCCITTGATCTTTTCTACGGGGTCT 

GAC 

GCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGA 
GA 

TTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAA 

GTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAG 

TTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTA 

TTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTAC 

GATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACC 

GCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCA 

GCCAGGCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATC 

CGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGT 

AGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAG 

GCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTC 
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CGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGC 

AAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTA 

AGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAA 

TTCTCTTACTGTCATGCCATCCGTAA.GATGCTTTTCTGTGACTGGTG 

AGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGA 

GTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAG 

CAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGA 

AAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAAC 

CCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGC 

GTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAA 

GGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTT 

TTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCG 

GATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTC 

CGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCA 

TTATTATCATGACATTAACCTATAAAAATAGGCGTATCACGAGGCC 
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CTTTCGTCTTCAAGAATTAATTCTCATGTTTGACAGCTTATCATCGA 

TAAGCTGACTCATGTTGGTATTGTGAAATAGACGCAGATCGGGAA 
5 CACTGAAAAATAACAGTTATTATTCGAGATC 

pUB114 

GATCTAACATCCAAAGACGAAAGGTTGAATGAAACCTTTTTGCCAT 

10 CCGACATCCACAGGTCCATTCTCACACATAAGTGCCAAACGCAAC 

SEQID 

AGGAGGGGATACACTAGCAGCAGACCGTTGCAAACGCAGGAGCTC 

N0:32 

CACTCCTCTTCTCCTCAACACCCACITTTGCCATCGAAAAACCAGC 

IS 

CCAGTTATTGGGCTTGATTGGAGCTCGCTCATTCCAATTCCTTCTAT 

TAGGCTACTAACACCATGACTTTATTAGCCTGTCTATCCTGGCCCC 

20 CCTGGCGAGGTTCATGTTTGTTTATTTCXGAATGCAACAAGCTCCG 

CATTACACCCGAACATCACTCCAGATGAGGGCTTTCTGAGTGTGGG 

GTCAAATAGTTTCATGTTCCCCA(\ATGGCCCAAAACTGACAGTTTA 

AACGCTGTCTTGGAACCTAATATGACAAAAGCGTGATCTCATCCAA 

GATGAACTAAGTTTGGTTCGTTGAAATGCTAACGGCCAGTTGGTCA 

30 AAAAGAAACTTCCAAAAGTCGCCATACCGTTTGTCTTGTTTGGTAT 
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TGATTGACGAATGCTCAAAAATAATCTCATTAATGCTTAGCGCAGT 

CTCTCTATCGCTTCTGAACCCCGGTGCACCTGTGCCGAAACGCAAA 

TGGGGAAACACCCGCTTTTTGGATGATTATGCATTGTCTCCACATT 

GTATGCTTCCAAGATTCTGGTQGGAATACTGCTGATAGCCTAACGT 

TCATGATCAAAATTTAACTGTTCTAACCCCTACTTGACAGCAATAT 

ATAAACAGAAGGAAGCTGCCCTGTCTTAAACCTTTT"rrTTTATCAT 

CATTATTAGCTTACTTTCATAATTGCGACTGGTTCCAATTGACAAG 

CTTTTGATTTTAACGACTTTTAACGACAACTTGAGAAGATCAA 
AAA 

ACAACTAATTATTCGAAGGATCCAAACGATGAGATTTCCTTCA 
ATT 

TTTACTGCAGTTTTATTCGCAGCATCCTCCGCATTAGCTGCTCC 
AGT 

CAACACTACAACAGAAGATGAAACGGCACAAATTCCGGCTGA 
AGC 

TGTCATCGGTTACTCAGATTTAGAAGGGGATTTCGATGTTGCT 
GTT 

TTGCCATTTTCCAACAGCACAAATAACGGGTTATTGTTTATAA 
ATA 

CTACTATTGCCAGCATTGCTGCTAAAGAAGAAGGGGTATCTCT 
CGA 

GAAAAGATGCGGACCGGGCAGGGGGTTCGGGAAGAGGAGGC 
ACC 
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CCAAAAAGCTGACCCCTTTAGCCTACAAGCAGTTTATCCCCAA 
TGT 

GGCCGAGAAGACCCTAGGCGCCAGCGGAAGGTATGAAGGGAA 
GA 

TCTCCAGAAACTCCGAGCGATTTAAGGAACTCACCCCCAATTA 
CAA 

CCCCGACATCATATTTAAGGATGAAGAAAACACCGGAGCGGA 

CAG 

GCTGATGACTCAGAGGTGTAAGGACAAGTTGAACGCTTTGGCC 
AT 

CTCGGTGATGAACCAGTGGCCAGGAGTGAAACTGCGGGTGAC 
CGA 

GGGCTGGGACGAAGATGGCCACCACTCAGAGGAGTCTCTGCA 
CTA 

CGAGGGCCGCGCAGTGGACATCACCACGTCTGACCGCGACCG 
CAG 

CAAGTACGGCATGCTGGCCCGCCTGGCGGTGGAGGCCGGCTTC 
GA 

CTGGGTGTACTACGAGTCCAAGGCACATATCCACTGCTCGGTG 
AA 

AGCAGAGAACTCGGTGGCGGCCAAATCGGGAGGCGTCGACGT 
GCC 

CAGGGATTGTGGTTGTAAGCCITGCATATGTACAGTCCCAGAA 
GTA 

TCATCTGTCTTCATCTTCCCCCCAAAGCCCAAGGATGTGCTCAC 
CA 

TTACTCTGACTCCTAAGGTCACGTGTGTTGTGGTAGACATCAG 
CAA 

GGATGATCCCGAGGTCCAGTTCAGCTGGTTTGTAGATGATGTG 
GAG 
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GTGCACACAGCTCAGACGCAACCaCGGGAaGAGCAGTTCCAAA 
GC 

ACTTTCCGCTCAGTCAGTGAACTTCCCATCATGCACCAGGACT 
GGC. 

TCAATGGCAAGGAGTTCAAATGCAGGGTCAACAGTGCAGCTTT 
CC 

CTGCCCCCATCGAGAAAACCATCTCCAAAACCAAAGGCAGAC 
CGA 

AGGCTCCACAGGTGTACACCATTCCACCTCCCAAGGAGCAGAT 
GG 

CCAAGGATAAAGTCAGTCTGACCTGCATGATAACAGACTTCTT 
CCC 

TGAAGACATTACTGTGGAGTGGCAGTGGAATGGGCAGCCAGC 

GGA 

GAACTACAAGAACACTCAGCCCATCATGGACACAGATGGCTC 
TTA 

CTTCGTCTACAGCAAGCTCAATGTGCAGAAGAGCAACTGGGA 
GGC » 

AGGAAATACTTTCACCTGCTCTGTGTTACATGAGGGCCTQCAC 
AAC 

CACCATACTGAGAAGAGCCTCTCCCACTCTCCTGGTAAATGAT 
CCC 

AGTGTCCTTGGAGCCCTCTGGTCCTACAGCGGCCGCGAATTAA 
TTC 

GCCTTAGACATGACTGTTCCTCAGTTCAAGTTGGGCACTTACG 
AGA 

AGACCGGTCTTGCTAGATTCTAATCAAGAGGATGTCAGAATGC 
CAT 

TTGCCTGAGAGATGCAGGCTTCATTTTTGATACTTITITATTTG 
TAA 
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CCTATATAGTATA<K}ATTTTTTTTGT(^TTTTGTTTCTTCTCGTA 
CG 

AGCTTGCTCCTGATCAGCCTATCTCGCAGCTGATGAATATCTT 
GTG 

GTAGGGGTTTGGGAAAATCATTCGAGTTTGATGTTTTTCTTGGT 
AT 

TTCCCACTCCTCTTCAGAGTACAGAAGATTAAGTGAGAAGTTC 

GTT 

TGTGCAAGCTTATCGATAAGCTTTAATGCGGTAGTTTATCACA 
GTT 

AAATTGCTAACGCAGTCAGGCACCGTGTATGAAATCTAACAAT 
GC 

GCTCATCGTCATCCTCGGCACCGTCACCCTGGATGCTGTAGGC 
ATA 

GGCTTGGTTATGCCGGTACTGCCGGGCCTCTTGCGGGATATCG 
TCC 

ATTCCGACAGCATCGCCAGTCACTATGGCGTGCTGCTAGCGCT 
ATA 

TGCGTTGATGCAATTTCTATGCGCACCCGTTCTCGGAGCACTG 
TCC 

GACCGCTTTGGCCGCCGCCCAGTCCTGCTCGCTTCGCTACTTG 
GAG 

CCACTATCGACTACGCGATCATGGCGACCACACCCGTCCTGTG 

GAT 

CTATCGAATCTAAATGTAAGTTAAAATCTCTAAATAATTAAAT 
AAG 

TCCCAGTTTCTCCATACGAACCTTAACAGCATTGCGGTGAGCA 
TCT 

AGACCTTCAACAGCAGCCAGATCCATCACTGCTTGGCCAATAT 
GTT 
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TCAGTCCCTCAGGAGTTACGTCTTGTGAAGTGATGAACTTCTG 
GAA 

GGTTGCAGTGTTAACTCCGCTGTATTGACGGGCATATCCGTAC 
GTT 

5 GGCAAAGTGTGGTTGGTACCGGAGGAGTAATCTCCACAACTCT 

CT 

GGAGAGTAGGCACCAACAAACACAGATCCAGCGTGTTGTACT 
TGA 

TCAACATAAGAAGAAGCATTCTCGATTTGCAGGATCAAGTGTT 
10 CA 

GGAGCGTACTGATTGGACATTTCCAAAGCCTGCTCGTAGGTTG 
CAA 

CCGATAGGGTTGTAGAGTGTGCAATACACTTGCGTACAATTTC 
AAC 

15 CCTTGGCAACTGCACAGCTTGGTTGTGAACAGCATCTTCAATT 

CTG 

GCAAGCTCCTTGTCTGTCATATCGACAGCCAACAGAATCACCT 
GGG 

AATCAATACCATGTTCAGCTTGAGCAGAAGGTCTGAGGCAAC 
20 GAA 

ATCTGGATCAGCGTATTTATCAGCAATAACTAGAACTTCAGAA 
GGC 

CCAGCAGGCATGTCAATACTACACAGGGCTGATGTGTCATTTT 
GAA 

25 CCATCATCTTGGCAGCAGTAACGAACTGGTTTCCTGGACCAAA 

TAT 

TTTGTCACACTTAGGAACAGTTTCTGTTCCGTAAGCCATAGCA 
GOT 

ACTGCCrGGGCGCCTCCTGCTAGCACGATACACTTAGCACCAA 
30 CCT 
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TGTGGGCAACGTAGATGACTTCTGGGGTAAGGGTACCATCCTT 
CTT 

AGGTGGAGATGCAAAAACAATTTCTTTGCAACCAGCAACTTTG 
GC 

AGGAACACCCAGCATCAGGGAAGTGGAAGGCAGAATTGCGGT 
TCC 

ACCAGGAATATAGAGGCCAACTTTCTCAATAGGTCTTGCAAAA 

CG 

AGAGCAGACTACACCAGGGCAAGTCTCAACTTGCAACGTCTCC 
GT 

TAGTTGAGCTTCATGGAATTTCCTGACGTTATCTATAGAGAGA 
TCA 

ATGGCTCTCTTAACGTTATCTGGCAATTGCATAAGTTCCTCTGG 
GA 

AAGGAGCTTCTAACACAGGTGTCTTCAAAGCGACTCCATCAAA 
CTT 

GGCAGTTAGTTCTAAAAGGGCTTTGTCACCATrTTGACGAACA 
TTG 

TCGACAATTGGTTTGACTAATTCCATAATCTGTTCCGTTTTCTG 
GAT 

AGGACGACGAAGGGCATCTTCAATTTCTTGTGAGGAGGCCTTA 
GA 

AACGTCAATTTTGCACAATTCAATACGACCTTCAGAAGGGACT 
TCT 

TTAGGTTTGGATTCTTCTTTAGGTTGTTCCTTGGTGTATCCTGG 
CTT 

GGCATCTCCTTTCCTTCTAGTGACCTTTAGGGACTTCATATCCA 
GGT 

TTCTCTCCACCTCGTCCAACGTCACACCGTACTTGGCACATCTA 
AC 
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TAATGCAAAATAAAATAAGTCAGCACATTCCCAGGCTATATCT 
TCC 

TTGGATTTAGCTTCTGCAAGTTCATCAGCTTCCTCCCTAATTTT 
AGC 

GTTCAAACAAAACTTCGTCGTCAAATAACCGTTTGGTATAAGA 
ACC 

TTCTGGAGCATTGCTCTTACGATCCCACAAGGTGCTTCCATGG 

CTC 

TAAGACCCTTTGATTGGCCAAAACAGGAAGTGCGTTCCAAGTG 
AC 

AGAAACCAACACCTGTTTGTTCAACCACAAATTTCAAGCAGTC 
TCC 

ATCACAATCCAATTCGATACCCAGCAACTTTTGAGTTCGTCCA 
GAT 

GTAGCACCTTTATACCACAAACCGTGACGACGAGATTGGTAGA 
CT 

CCAGTTTGTGTCCTTATAGCCTCCGGAATAGACTTTTTGGACG 
AGT 

ACACCAGGCCCAACGAGTAATTAGAAGAGTCAGCCACCAAAG 
TAG 

TGAATAGACCATCGGGGCGGTCAGTAGTCAAAGACGCCAACA 
AAA 

TTTCACTGACAGGGAACTTTTTGACATCTTCAGAAAGTTCGTA 
TTC 

AGTAGTCAATTGCCGAGCATCAATAATGGGGATTATACCAGA 
AGC 

AACAGTGGAAGTCACATCTACCAACTTTGCGGTCTCAGAAAAA 
GC 

ATAAACAGTTCTACTACCGCCATTAGTGAAACTTTTCAAATCG 
CCC 
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AGTGGAGAAGAAAAAGGCACAGCGATACTAGCATTAGCGGGC 
AA 

GGATGCAACTTTATCAACCAGGGTCCTATAOATAACCCTAGCG 
OCT 

GGGATCATCCTTTGGACAACTCTTTCTGCCAAATCTAGGTCCA 
AAA 

TCACTTCATTGATACCATTATTGTACAACTTGAGCAAGTTGTCG 

AT 

CAGCTCCTCAAATTGGTCCTCTGTAACGGATGACTCAACTTGC 
ACA 

TTAACTTGAAGCTCAGTCGATTGAGTGAACTTGATCAGGTTGT 
GCA 

GCTGGTCAGCAGCATAGGOAAACACGGCTTTTCCTACCAAACT 
CA 

AGGAATTATCAAACTCTGCAACACTTGCGTATGCAGGTAGCAA 
GG 

GAAATGTCATACTTGAAGTCGGACAGTGAGTGTAGTCTTGAGA 
AA 

TTCTGAAGCCGTATTTTTATTATCAGTGAGTCAGTCATCAGGA 
GAT 

CCTCTACGCCGGACGCATCGTGGCCGACCTGCAGGTCGGCATC 
ACC 

GGCGCCACAGGTGCGGTTGCTGGCGCCTATATCGCCGACATCA 
CC 

GATGGGGAAGATCGGGCTCGCCACTTCGGGCTCATGAGCGCTT 
OTT 

TCGGCGTGGGTATGGTGGCAGGCCCCGTGGCCGGGGGACTGTT 
GG 

GCGCCATCTCCTTGGACCTGCAGGGGGGGGGGGGGAAAGCCA 
CGT 
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TGTGTCTCAAAATCTCTGATGTTACATTGCACAAGATAAAAAT 

ATA 

TCATCATGAACAATAAAACTGTCTGCTTACATAAACAGTAATA 
CAA 

5 GGGGTGTTATGAGCCATATTCAACGGGAAACGTCTTGCTCAAG 

GC 

CGCGATTAAATTCCAACATGGATGCTGATTTATATGGGTATAA 
ATG 

GGCTCGCGATAATGTCGGGCAATCAGGTGCGACAATCTATCGA 
10 TT 

GTATGGGAAGCCCGATGCGCCAGAGTTGTTTCTGAAACATGGC 
AA 

AGGTAGCGTTGCCAATGATGTTACAGATGAGATGGTCAGACTA 
AA 

15 ' CTGGCTGACGGAATrTATGCCTCTTCCGACCATCAAGCATTTT 

ATC 

CGTACTCCTGATGATGCATGGTTACTCACCACTGCGATCCCCG 
GGA 

AAACAGCATTCCAGGTATTAGAAGAATATCCTGATTCAGGTGA 
20 AA 

ATATTGTTGATGCGCTGGCAGTGTTCCTGCGCCGGTTGCATTC 
GAT 

TCCTGTTTGTAATTGTCCTTTTAACAGCGATCGCGTATTTCGTC 
TCG 

25 CTCAGGCGCAATCACGAATGAATAACGGTTTGGTTGATGCGAG 

TG 

ATTTTGATGACGAGCGTAATGGCTGGCCTGTTGAACAAGTCTG 
GAA 

AGAAATGCATAAGCTTTTGCCATTCTCACCGGATTCAGTCGTC 
30 ACT 
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CATCKjTGATTTCTCACTTGATAACCTTATTTTTGACGAGGGGA 
AAT 

TAATAGGTTGTATTGATGTTGGACGAGTCGGAATCGCAGACCO 
AT 

ACCAGGATCTTGCCATCCTATGGAACTGCCTCGGTGAGTTTTC 
TCC 

TTCATTACAGAAACGGCTTTTTCAAAAATATGGTATTGATAAT 
CCT 

GATATGAATAAATTGCAGTTTCATITGATGCTCGATGAGTTTTT 
CT 

AATCAGAATTGGTTAATTGGTTGTAACACTGGCAGAGCATTAC 
GOT 

GACTTGACGGGACGGCGGCTTTGTTGAATAAATCGAACTTTTG 
CTG 

AGTTGAAGGATCAGATCACGCATCTTCCCGACAACGCAGACC 
GTT 

CCGTGGCAAAGCAAAAGTTCAAAATCACCAACTGGTCCACCT 
ACA 

ACAAAGCTCTCATCAACCGTGGCTCCCTCACTTTCTGGCTGGA 
TGA 

TGGGGCGATTCAGGCCTGGTATGAGTCAGCAACACCTTCTTCA 
CGA 

GGCAGACCTCAGCGCCCCCCCCCCCCTGCAGGTCCCACGGCGG 
CG 

GTGCTCAACGGCCTCAACCTACTACTGGGCTGCTTCCTAATGC 
AGG 

AGTCGCATAAGGGAGAGCGTCGAGTATCTATGATTGGAAGTA 
TGG 

GAATGGTGATACCCGCATTCTTCAGTGTCTTGAGGTCTCCTATC 
AG 
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ATTATGCCCAACTAAAGCAACCGGAGGAGGAGATTTCATGGT 
AAA 

TTTCTCTGACTTTTGGTCATCAGTAGACTCGAACTGTGAGACTA 
TCT 

CGGTTATGACAGCAGAAATGTCCTTCTTGGAGACAGTAAATGA 
AG 

TCCCACCAATAAAGAAATCCTTGTTATCAGGAACAAACTTCTT 

GTT 

TCGAACTTTTTCGGTGCCTTGAACTATAAAATGTAGAGTGGAT 
ATG 

TCGGGTAGGAATGGAGCGGGCAAATGCTTACCTTCTGGACCTT 
CA 

AGAGGTATGTAGGGTTTGTAGATACTGATGCCAACTTCAGTGA 
CA 

ACGTTGCTATTTCGTTCAAACCATTCCGAATCCAGAGAAATCA 
AAG 

TTGTTTGTCTACTATTGATCCAAGCCAGTGCGGTCTTGAAACTG 
AC 

AATAGTGTGCTCGTGTTTTGAGGTCATCTTTGTATGAATAAATC 
TA 

GTCTTTGATCTAAATAATCTTGACGAGCCAAGGCGATAAATAC 
CCA 

AATCTAAAACTCTTTTAAAACGTTAAAAGGACAAGTATGTCTG 

CCT 

GTATTAAACCCCAAATCAGCTCGTAGTCTGATCCTCATCAACT 
TGA 

GGGGCACTATCTTGTTTTAGAGAAATTTGCGGAGATGCGATAT 
CGA 

GAAAAAGGTACGCTGATTTTAAACGTGAAATTTATCTCAAGAT 
CTC 
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TGCCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACA 
TGC 

AGCTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGG 
GA 

GCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGT 
GTC 

GGGGCGCAGCCATGACCCAGTCACGTAGCGATAGCGGAGTGT 
ATA 

CTGGCTTAACTATGCGGCATCAQAGCAGATTGTACTGAGAGTG 
CA 

CCATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAA 
ATA 

CCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGC 
GC 

TCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGG 
CG 

GTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAG 
AAC 

ATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAA 
GGC 

CGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAG 
CAT 

CACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACA 
GGA 

CTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGC 
GOT 

CTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTT 
CTC 

CCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGT 
ATC 
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TCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCA 
CGA 

ACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTAT 
CGT 

CTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAG 
CA 

GCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGT 

GCT 

ACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAA 
GG 

ACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCG 
GAA 

AAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTG 
GTA 

GCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAA 
AAA 

AGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGAC 
GCT 

CAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGAT 
TA 

TCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTT 

TTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTA 

CCAATG<mAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTT 

CGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGA 

TACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGC 
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GAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGC 

CAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCG 

CCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAG 

TTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGC 

ATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCG 

GTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAA 

AAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAG 

TTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATT 

CTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGA 

GTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAG 

TTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGC 

AGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGA 

AAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAAC 

CCACTCGTGCACCCAACTGATCITCAGCATCrTTTACTrTCACCAGC 

GTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAA 
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GGGAATAAGGGCGACACGGAAATGTTGAATACrCATACTCTTCCTT 

TTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCG 

GATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTC 

CGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCA 

TTATTATCATGACATTAACCTATAAAAATAGGCGTATCACGAGGCC 

CTTTCGTCTTCAAGAATTAATTCTCATGTTTGACAGCTTATCATCGA 

TAAGCTGACTCATGTTGGTATTGTGAAATAGACGCAGATCGGGAA 
CACTGAAAAATAACAGTTATTATTCGAGATC 

pUB 115 

GATCTAACATCCAAAGACGAAAGGTTGAATGAAACCTTTTTGCCAT 

CCGACATCCACAGGTCCATTCTCACACATAAGTGCCAAACGCAAC 
SEQID 

AGGAGGGGATACACTAGCAGCAGACCGTTGCAAACGCAGGACCTC 

N0:33 

CACTCCTCTTCTCCTCAACACCCACTTTTGCCATCGAAAAACCAGC 

CCAGTTATTGGGCnrrGATTGGAGCTCGCTCATTCCAATTCCTTCTAT 

TAGGCTACTAACACCATGACTTTATTAGCCTGTCTATCCTGGCCCC 

CCTGGCGAGGTTCATGTTTGTTTATTTCCGAATGCAACAAGCTCCG 
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CATTACACCCGAACATCACTCCAGATGAGGGCTTTCTGAGTGTGGG 

GTCAAATAGTTTCATGTTCCCCAAATGGCCCAAAACTGACAGTTTA 

AACGCTGTCTTGGAACCTAATATGACAAAAGCGTGATCTCATCCAA 

GATGAACTAAGTTTGGTTCGTTGAAATGCTAACGGCCAGTTGGTCA 

AAAAGAAACTTCCAAAAGTCGCCATACCGTTTGTCTTGTTTGGTAT 

TGATTGACGAATGCTCAAAAATAATCTCATTAATGCTTAGCGCAGT 

CTCTCTATCGCTTCTGAACCCCGGTGCACCTGTGCCGAAACGCAAA 

TGGGGAAACACCCGCTTTTTGGATGATTATGCATTGTCTCCACATT 

GTATGCTTCCAAGATTCTGGTGGGAATACTGCTGATAGCCTAACGT 

TCATGATCAAAATTTAACTGTTCTAACCCCTACTTGACAGCAATAT 

ATAAACAGAAGGAAGCTGCCCn'GTCTTAAACCTTTITITTTATCAT 

CATTATTAGCTTACTTTCATAATTGCGACTGGTTCCAATTGACA 

AG 

CTTTTGATTTTAACGACTITTAACGACAACTTGAGAAGATCAA 
AAA 

ACAACTAATTATTCGAAGGATCCAAACGATGAGATTTCCTTCA 
ATT 

TTTACTGCAGTTTTATTCGCAGCATCCTCCGCATTAGCTGCTCC 
AGT 
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CAACACTACAACAGAAGATGAAACGGCACAAATTCCGGCTGA 
AGC 

TGTCATCGGTTACTCAGATTTAGAAGGGGATTTCGATGTTGCT 
GTT 

TTGCCATTTTCCAACAGCACAAATAACGGGTTATTGTTTATAA 
ATA 

CTACTATTGCCAGCATTGCTGCTAAAGAAGAAGGGGTATCTCT 
CGA 

GAAAAGATGCGGACCGGGCAGGGGGTTCGGGAAGAGGAGGC 
ACC 

CCAAAAAGCTGACCCCnTAGCCTACAAGCAGTTTATCCCCAA 
TGT 

GGCCGAGAAGACCCTAGGCGCCAGCGGAAGGTATGAAGGGAA 

OA 

TCTCCAGAAACTCCGAGCGATTTAAGGAACTCACCCCCAATTA 
CAA 

CCCCGACATCATATTTAAGGATGAAGAAAACACCGGAGCGGA 
CAG 

GCTGATGACTCAGAGGTGTAAGGACAAGTTGAACGCTTTGGCC 
AT 

CTCGGTGATGAACCAGTGGCCAGGAGTGAAACTGCGGGTGAC 
CGA 

GGGCTGGGACGAAGATGGCCACCACTCAGAGGAGTCTCTGCA 

CTA 

CGAGGGCCGCGCAGTGGACATCACCACGTCTGACCGCGACCG 
CAG 

CAAGTACGGCATGCTGGCCCGCCTGGCGGTGGAGGCCGGCTTC 
GA 

CTGGGTGTACTACGAGTCCAAGGCACATATCCACTGCTCGGTG 
AA 
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AGCAGAGAACTCGGTGGCGGCCAAATCGGGAGGCGTCGACCC 
CAG 

AGGGCCCACAATCAAGCCCTGTCCTCCATGCAAATGCCCAGCA 
CCT 

AACCTCTTGGGTGGACCATCCGTCTTCATCTTCCCTCCAAAGAT 
CA 

AGGATGTACTCATGATCTCCCTGAGCCCCATAGTCACATGTGT 

GGT 

GGTGGATGTGAGCGAGGATGACCCAGATGTCCAGATCAGCTG 
GTT 

TGTGAACAACGTGGAAGTACACACAGCTCAGACACAAACCCA 
TAG 

AGAGGATTACCAAAGTACaCTtCGGGTGGTCAGTGCCCTCCCCA 
TCC 

AGCACCAGGACTGGATGAGTGGCAAGGAGTTCAAATGCAAGG 
TCA 

ACAACAAAGACCTCCCAGCGCCCATCGAGAGAACCATCTCAA 
AAC 

CCAAAGGGTCAGTAAGAGCTCCACAGGTATATGTCTTGCCTCC 
ACC 

AGAAGAAGAGATGACTAAGAAACAGGTCACTCTGACCTGCAT 
GGT 

GACAGACTTCATGCCTGAAGACATTTACGTGGAGTGGACCAAC 
AA 

CGGGAAAACAGAGCTAAACTACAAGAACACTGAACCAGTCCT 
GGA 

CTCTGATGGTTCTTACTTCATGTACAGCAAGCTGAGAGTGGAA 
AAG 

AAGAACTGGGTGGAAAGAAATAGCTACTCCTGTTCAGTGGTCC 
AC 
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GAGGGTCTGCACAATCACCACACGACTAAGAGCTTCTCCCGGA 
CT 

CCGGGTAAATGAGCTCAGATCGATTCCATGGATCCTCACATCC 
CAA 

TCCGCGGCCGCGAATTAATTCGCCTTAGACATGACTGTTCCTC 
AGT 

TCAAGTTGGGCACTTACGAGAAGACCGGTCTTGCTAGATTCTA 
ATC 

AAGAGGATGTCAGAATGCCATTTGCCTGAGAGATGCAGGCTTC 
AT 

TTTTGATACTTTTTTATTTGTAACCTATATAGTATAGGATTTTTT 
TTG 

TCATTTrGTITCTrCTCGTACGAGCTTGCTCCTGATCAGCCTAT 
CTC 

GCAGCTGATGAATATCTTGTGGTAGGGGTTTGGGAAAATCATT 
CGA 

GTITGATGTTTTTCTTGGTATTTCCCACTCCTCTTCAGAGTACA 
GAA 

GATTAAGTGAGAAGTTCGTTTGTGCAAGCTTATCGATAAGCTT 
TAA 

TGCGGTAGTTTATCACAGTTAAATTGCTAACGCAGTCAGGCAC 
CGT 

GTATGAAATCTAACAATGCGCTCATCGTCATCCTCGGCACCGT 
CAC 

CCTGGATGCTGTAGGCATAGGCTTGGTTATGCCGGTACTGCCG 
GGC 

CTCTTGCGGGATATCGTCCATTCCGACAGCATCGCCAGTCACT 
ATG 

GCGTGCTGCTAGCGCTATATGCGTTGATGCAATTTCTATGCGC 
ACC 
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CGTTCTCGGAGCACTGTCCGACCGCTTTGGCCGCCGCCCAGTC 

CTG 

CTCGCTTCGCTACTTGGAGCCACTATCGACTACGCGATCATGG 
CGA 

CCACACCCGTCCTGTGGATCTATCGAATCTAAATGTAAGTTAA 
AAT 

CTCTAAATAATTAAATAAGTCCCAGTTTCTCCATACGAACCTT 
AAC 

AGCATTGCGGTGAGCATCTAGACCTTCAACAGCAGCCAGATCC 
AT 

CACTGCTTGGCCAATATGTTTCAGTCCCTCAGGAGTTACGTCTT 
GT 

GAAGTGATGAACTTCTGGAAOGTTGCAGTOTTAACTCCGCTGT 
ATT 

GACGGGCATATCCGTACGTTGGCAAAGTGTGGTTGGTACCGGA 
GG 

AGTAATCTCCACAACTCTCTGGAGAGTAGGCACCAACAAACA 
GAG 

ATCCAGCGTGTTGTACTTGATCAACATAAGAAGAAGCATTCTC 
GAT 

TTGCAGGATCAAGTGTTCAGGAGCGTACTGATTGGACATTTCC 
AAA 

GCCTGCTCGTAGGTTGCAACCGATAGGGTTGTAGAGTGTGCAA 
TAG 

ACTTGCGTACAATTTCAACCCTTGGCAACTGCACAGCTTGGTT 
GTG 

AACAGCATCTTCAATTCTGGCAAGCTCCTTGTCTGTCATATCG 
ACA 

GCCAACAGAATCACCTGGGAATCAATACCATGTTCAGCTTGAG 
CA 

-173- 



PCT/USOl/19435 



GAAGGTCTGAGGCAACGAAATCTGGATCAGCGTATTTATCAGC 
AA 

TAACTAGAACTTCAGAAGGCCCAGCAGGCATGTCAATACTAC 
ACA 

GGGCTGATGTGTCATTTTGAACCATCATCTTGGCAGCAGTAAC 
GAA 

CTGGTTTCCTGGACCAAATATTTTGTCACACTTAGGAACAGTTr 
CT 

GTTCCGTAAGCCATAGCAGCTACTGCCTGGGCGCCTCCTGCTA 
GCA 

CGATACACTTAGCACCAACCTTGTGGGCAACGTAGATGACTTC 
TGG 

GGTAAGGGTACCATCCTTCTTAGGTGGAGATGCAAAAACAATT 
TCT 

TTGCAACCAGCAACTTTGGCAGGAACACCCAGCATCAGGGAA 
GTG 

GAAGGCAGAATTGCGGTTCCACCAGGAATATAGAGGCCAACT 
TTC 

TCAATAGGTCTTGCAAAACGAGAGCAGACTACACCAGGGCAA 
GTC 

TCAACTTGCAACGTCTCCGTTAGTTGAGCTTCATGGAATTTCCT 
GA 

CGTTATCTATAGAGAGATCAATGGCTCTCTTAACGTTATCTGG 
CAA 

TTGCATAAGTTCCTCTGGGAAAGGAGCTTCTAACACAGGTGTC 
TTC 

AAAGCGACTCCATCAAACTTGGCAGTTAGITCTAAAAGGGCTT 
TGT 

CACCATTTTGACGAACATTGTCGACAATTGGTTTGACTAATTC 
CAT 
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AATCTGTTCCGTTTTCTGGATAGGACGACGAAGGGCATCTTCA 
ATT 

TCTTGTGAGGAGGCCTTAGAAACGTCAATTTTGCACAATTCAA 
TAG 

GACCTTCAGAAGGGACTTCTTTAGGTTTGGATTCTTCTTTAGGT 
TOT 

TCCTTGGTGTATCCTGGCTTGGCATCTCCTTTCCTTCTAGTGAC 
CTT 

TAGGGACTTCATATCCAGGTTTCTCTCCACCTCGTCCAACGTCA 
CA 

CCGTACTTGGCACATCTAACTAATGCAAAATAAAATAAGTCAG 
CA 

CATTCCCAGGCTATATCTTCCTTGGATTTAGCTTCTGCAAGTTC 
ATC 

AGCTTCCTCCCTAATTTTAGCGTTCAAACAAAACTTCGTCGTCA 
AA 

TAACCGTTTGGTATAAGAACCTTCTGGAGCATTGCTCTTACGA 
TCC 

CACAAGGTGCTTCCATGGCTCTAAGACCCTTTGATTGGCCAAA 
ACA 

GGAAGTGCGTTCCAAGTGACAGAAACCAACACCTGTTTGTTCA 
AC 

CACAAATTTCAAGCAGTCTCCATCACAATCCAATTCGATACCC 
AGC 

AACTTTTGAGTTCGTCCAGATGTAGCACCTTTATACCACAAAC 
CGT 

GACGACGAGATTGGTAGACTCCAGTTTGTGTCCTTATAGCCTC 
CGG 

AATAGACTTTTTGGACGAGTACACCAGGCCCAACGAGTAATTA 
GA 
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AGAGTCAGCCACCAAAGTAGTGAATAGACCATCGGGGCGGTC 
AGT 

AGTCAAAGACGCCAACAAAATTTCACTGACAGGGAACTTTTTG 
AC 

ATCTTCAGAAAGTTCGTATTCAGTAGTCAATTGCCGAGCATCA 
ATA 

ATGGGGATTATACCAGAAGCAACAGTGGAAGTCACATCTACC 
AAC 

TTTGCGGTCTCAGAAAAAGCATAAACAGTTCTACTACCGCCAT 
TAG 

TGAAACTTTTCAAATCGCCCAGTGGAGAAGAAAAAGGCACAG 
CGA 

TACTAGCATTAGCGGGCAAGGATGCAACTTTATCAACCAGGGT 
CCT 

ATAGATAACCCTAGCGCCTGGGATCATCCTTTGGACAACTCTT 
TCT 

GCCAAATCTAGGTCCAAAATCACTTCATTGATACCATTATTGT 
ACA 

ACTTGAGCAAGTTGTCGATCAGCTCCTCAAATTGGTCCTCTGT 
AAC 

GGATGACTCAACTTGCACATTAACTTGAAGCTCAGTCGATTGA 
GTG 

AACTTGATCAGGTTGTGCAGCTGGTCAGCAGCATAGGGAAAC 

ACGGC 

TTTTCCTACCAAACTCAAGGAATTATCAAACTCTGCAACACTT 
G 

CGTATGCAGGTAGCAAGGGAAATGTCATACTTGAAGTCGGAC 
AGT 

GAGTGTAGTCTTGAGAAATTCTGAAGCCGTATTTTTATTATCA 
GTG 
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AGTCAGTCATCAQGAGATCCTCTACGCCGGACGCATCGTGGCC 
GA 

CCTGCAGGTCGGCATCACCGGCGCCACAGGTGCGGTTGCTGGC 
GC 

5 CTATATCGCCGACATCACCGATGGGGAAGATCGGGCTCGCCAC 

TTC 

GGGCTCATGAGCGCTTGTTTCGGCGTGGGTATGGTGGCAGGCC 

CCG 

TGGCCGGGGGACTGTTGGGCGCCATCTCCTTGGACCTGCAGGG 
10 GG 

GGGGGGGGAAAGCCACGTTGTGTCTCAAAATCTCTGATGTTAC 
ATT 

GCACAAGATAAAAATATATCATCATGAACAATAAAACTGTCT 
GCT 

15 TACATAAACAGTAATACAAGGGGTGTTATGAGCCATATTCAAC 

GG 

GAAACGTCTTGCTCAAGGCCGCGATTAAATTCCAACATGGATG 
CTG 

ATTTATATGGGTATAAATGGGCTCGCGATAATGTCGGGCAATC 
20 AG 

GTGCGACAATCTATCGATTGTATGGGAAGCCCGATGCGCCAGA 
GT 

TGTTTCTGAAACATGGCAAAGGTAGCGTTGCCAATGATGTTAC 
AGA 

25 TGAGATGGTCAGACTAAACTGGCTGACGGAATTTATGCCTCTT 

CCG 

ACCATCAAGCATTTTATCCGTACTCCTGATGATGCATGGTTACT 
CA 

CCACTGCGATCCCCGGGAAAACAGCATTCCAGGTATTAGAAG 
30 AAT 
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ATCCTGATTCAGGTGAAAATATTGTTGATGCGCTGGCAGTGTT 
CCT 

GCGCCGGTTGCATTCGATTCCTGTTTGTAATTGTCCTTTTAACA 
GCG 

5 ATCGCGTATTTCGTCTCGCTCAGGCGCAATCACGAATGAATAA 

CGG 

TTTGGTTGATGCGAGTGATTTTGATGACGAGCGTAATGGCTGG 
CCT 

GTTGAACAAGTCTGGAAAGAAATGCATAAGCrnTGCCATTCT 
10 CAC 

CGGATTCAGTCGTCACTCATGGTGATTTCTCACTTGATAACCTT 
ATT 

TrTGACGAGGGGAAATTAATAGGTTGTATTGATGTTGGACGAG 
TCG 

15 GAATCGCAGACCGATACCAGGATCTTGCCATCCTATGGAACTG 

CCT 

CGGTGAGTirrCTCCTTCATrACAGAAACGGCTTTTTCAAAAAT 
AT 

GGTATTGATAATCCTGATATGAATAAATTGCAGTTTCATTTGA 
20 TGC 

TCGATGAGTTTTTCTAATCAGAATTGGTTAATTGGTTGTAACAC 
TG 

GCAGAGCATTACGCTGACTTGACGGGACGGCGGCTTTGTTGAA 
TA 

25 AATCGAACTTTTGCTGAGTTGAAGGATCAGATCACGCATCTTC 

CCG 

ACAACGCAGACCGTTCCGTGGCAAAGCAAAAGTTCAAAATCA 
CCA 

ACTGGTCCACCTACAACAAAGCTCTCATCAACCGTGGCTCCCT 
30 CAC 
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TTTCTGGCTGGATGATGGGGCGATTCAGGCCTGGTATGAGTCA 

GCA 

ACACCTTCTTCACGAGGCAGACCTCAGCGCCCCCCCCCCCCTG 
CAG 

5 GTCCCACGGCGGCGGTGCTCAACGGCCTCAACCTACTACTGGG 

CTG 

CTTCCTAATGCAGGAGTCGCATAAGGGAGAGCGTCGAGTATCT 
AT 

GATTGGAAGTATGGGAATGGTGATACCCGCATTCTTCAGTGTC 

10 TTG 

AGGTCTCCTATCAGATTATGCCCAACTAAAGCAACCGGAGGA 

GGA 

GATTTCATGGTAAATTTCTCTGACTTTTGGTCATCAGTAGACTC 

GA 

15 ACTGTGAGACTATCTCGGTTATGACAGCAGAAATGTCCTTCTT 

GGA 

GACAGTAAATGAAGTCCCACCAATAAAGAAATCCTTGTTATCA 
GG 

AACAAACTTCTTGTTTCGAACITITTCGGTGCCTTGAACTATAA 
20 AA 

TGTAGAGTGGATATGTCGGGTAGGAATGGAGCGGGCAAATGC 
TTA 

CCTTCTGGACCTTCAAOAGGTATGTAGGGTTTGTAGATACTGA 
TGC 

25 CAACTTCAGTGACAACGTTGCTATTTCGTTCAAACCATTCCGA 

ATC 

CAGAGAAATCAAAGTTGTTTGTCTACTATTGATCCAAGCCAGT 
GCG 

GTCTTGAAACTGACAATAGTGTGCTCGTGTTTTGAGGTCATCTT 
30 TG 
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TATGAATAAATCTAGTCTTTGATCTAAATAATCTTGACGAGCC 
AAG 

GCGATAAATACCCAAATCTAAAACTCTTTTAAAACGTTAAAAG 
GA 

5 CAAGTATGTCTGCCTGTATTAAACCCCAAATCAGCTCGTAGTC 

TGA 

TCCTCATCAACTTGAGGGGCACTATCTTGTTTTAGAGAAATTT 

GCG 

GAGATGCGATATCGAGAAAAAGGTACGCTGATTTTAAACGTG 
10 AAA 

TTTATCTCAAGATCTCTGCCTCGCGCGTTTCGGTGATGACGGTG 
AA 

AACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTC 
TGT 

15 AAGCGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCGTCAG 

CG 

GGTGTTGGCGGGTGTCGGGGCGCAGCCATGACCCAGTCACGT 
AGC 

GATAGCGGAGTGTATACTGGCTTAACTATGCGGCATCAGAGCA 
20 GA 

TTGTACTGAGAGTGCACCATATGCGGTGTGAAATACCGCACAG 
AT 

GCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCGCTTCCTC 
GCT 

25 CACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTA 

TCA 

GCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGG 
GAT 

AACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGC 

30 CAG 
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GAACCGTAAAAAGGCCGCGTTGCTGGCGTmrCCATAGGCTC 

CGC 

CCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGG 
TGG 

CGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTG 
GA 

AGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGG 
AT 

ACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAA 
TG 

CTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCC 
AAG 

CTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCG 

CCT 

TATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGA 
CTT 

ATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGC 
GAG 

GTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAAC 
TAG 

GGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGA 
AGC 

CAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAA 
AC 

AAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCA 
GAT 

TACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTT 
TCT 

ACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGG 
ATT 
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TTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAA 

ATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAAC 

TTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCA 

GCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGT 

GTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGC 

TGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCA 

GCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCT 

GCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAG 

CTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGC 

CATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCT 

TCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCC 

CCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGT 

TGTCAGAAGTAAGITGGCCGCAGTGTTATCACTCATGGTTATGGCA 

GCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTC 

TGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATG 
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CGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACC 

GCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGT 

TCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCA 

GTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTT 

TACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAA 

TGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACT 

CATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATT 

GTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACA 

AATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTC 

TAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTA 

TCACGAGGCCCTTTCGTCTTCAAGAATTAATTCTCATGTTTGACAG 

CTTATCATCGATAAGCTGACTCATGTTGGTATTGTGAAATAGACGC 
AGATCGGGAACACTGAAAAATAACAGTTATTATTCGAGATC 

pUBiie 

GATCTAACATCCAAAGACGAAAGGTTGAATGAAACCrTTTTGCCAT 

CCGACATCCACAGGTCCATTCTCACACATAAGTGCCAAACGCAAC 
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AGGAGGGGATACACTAGCAGCAGACCGTTGCAAACGCAGGACCTC 

N0:34 

CACTCCTCTTCTCCTCAACACCCACTTTTGCCATCGAAAAACCAGC 

s 

CCAGTTATTGGGCTTGATTGGAGCTCGCTCATTCCAATTCCTTCTAT 
TAGGCTACTAACACCATGACTTTATTAGCCTGTCTATCCTGGCCCC 
10 CCTGGCGAGGTTCATGTTTGTTTATTTCCGAATGCAACAAGCTCCG 
CATTACACCCGAACATCACTCCAGATGAGGGCTTTCTGAGTGTGGG 
GTCAAATAGTTTCATGTTCCCCAAATGGCCCAAAACTGACAGTTTA 

IS 

AACGCTGTCTTGGAACCTAATATGACAAAAGCGTGATCTCATCCAA 
GATGAACTAAGTTTGGTTCGTTGAAATGCTAACGGCCAGTTGGTCA 
20 AAAAGAAACnrCCAAAAGTCGCCATACCGTTTGTCTTGTTTGGTAT 
TGATTGACGAATGCTCAAAAATAATCTCATTAATGCTTAGCGCAGT 
CTCTCTATCGCTTCTGAACCCCGGTGCACCTGTGCCGAAACGCAAA 

25 

TGGGGAAACACCCGCnTTTTGGATGATTATGCATTGTCTCCACATT 

GTATGCTTCCAAGATTCTGGTGGGAATACTGCTGATAGCCTAACGT 

30 TCATGATCAAAATTTAACTGTTCTAACCCCTACTTGACAGCAATAT 
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ATAAACAGAAGGAAGCTGCCCTGTCTTAAACCTTTTTTTTTATCAT 

CATTATTAGCTTACTTTCATAATTGCGACTGGTTCCAATTGACA 
AG 

CTTTTGATTTTAACGACTTTTAACGACAACTTGAGAAGATCAA 
AAA 

ACAACTAATTATTCGAAGGATCCAAACGATGAGATTTCCTTCA 
ATT 

TTTACTGCAGTTTTATTCGCAGCATCCTCCGCATTAGCTGCTCC 
AGT 

CAACACTACAACAGAAGATGAAACGGCACAAATTCCGGCTGA 
AGC 

TGTCATCGGTTACTCAGATTTAGAAGGGGATTTCGATGTTGCT 
GTT 

TTGCCATTTTCCAACAGCACAAATAACGGGTTATTGTTTATAA 
ATA 

CTACTATTGCCAGCATTGCTGCTAAAGAAGAAGGGGTATCTCT 
CGA 

GAAAAGATGCGGACCGGGCAGGGGGTTCGGGAAGAGGAGGC 
ACC 

CCAAAAAGCTGACCCCTTTAGCCTACAAGCAGTTTATCCCCAA 
TGT 

GGCCGAGAAGACCCTAGGCGCCAGCGGAAGGTATGAAGGGAA 
GA 

TCTCCAGAAACTCCGAGCGATTTAAGGAACTCACCCCCAATTA 
CAA 

CCCCGACATCATATTTAAGGATGAAGAAAACACCGGAGCGGA 
CAG 

GCTGATGACTCAGAGGTGTAAGGACAAGTTGAACGCTTTGGCC 
AT 
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CTCGGTGATGAACCAGTGGCCAGGAGTGAAACTGCGGGTGAC 

CGA 

GGGCTGGGACGAAGATGGCCACCACTCAGAGGAGTCTCTGCA 
CTA 

CGAGGGCCGCGCAGTGGACATCACCACGTCTGACCGCGACCG 
GAG 

CAAGTACGGCATGCTGGCCCGCCTGGCGGTGGAGGCCGGCTTC 
GA 

CTGGGTGTACTACGAGTCCAAGGCACATATCCACTGCTCGGTG 
AA 

AGCAGAGAACTCGGTGGCGGCCAAATCGGGAGGCGTCGACAA 
AA 

CTCACACATGCCCACCGTGCCCAGCACCTGAACTCCTGGGGGG 
AC 

CGTCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCAT 
GAT 

CTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGC 
CA 

CGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGT 
GGA 

GGTGCATAATGCCAAGACAAAGCCGcgggaggagcagtaccagagcacgta 
ccg^ 

tggTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGC 
AAG 

GAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCC 
ATC 

GAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCA 
CA 

GGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAAC 
CA 
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GGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGAC 
ATC 

GCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTAC 
AA 

5 GACCACGCCTCCCGTGTTGGACTCCGACGGCTCCTTCTTCCTCT 

AC 

AGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAAC 
GT 

CTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTAC 
10 ACG 

CAGAAGAGCCTCTCCCTGTCTCCCGGGAAATGAGTGCGGCGGC 
CG 

CGiy^TTAATTCGCCTTAGACATGACTGTTCCTCAGTTCAAGTT 
GGG 

15 CACTTACGAGAAGACCGGTCTTGCTAGATTCTAATCAAGAGGA 

TGT 

CAGAATGCCATTTGCCTGAGAGATGCAGGCTTCATTTTTGATA 
CTT 

TTTTATTTGTAACCTATATAGTATAGGATTTTTTTTGTCATTTTG 
20 TTT 

CTTCTCGTACGAGCTTGCTCCTGATCAGCCTATCTCGCAGCTGA 
TG 

AATATCTTGTGGTAGGGGTTTGGGAAAATCATTCGAGTTTGAT 
GTT 

25 TTTCTTGGTATTTCCCACTCCTCTTCAGAGTACAGAAGATTAAG 

TG 

AGAAGTTCGTTTGTGCAAGCTTATCGATAAGCTTTAATGCGGT 
AGT 

TTATCACAGTTAAATTGCTAACGCAGTCAGGCACCGTGTATGA 
30 AAT 
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CTAACAATGCGCTCATCGTCATCCTCGGCACCGTCACCCTGGA 
TGC 

TGTAGGCATAGGCTTGGTTATGCCGGTACTGCCGGGCCTCTTG 
CGG 

5 GATATCGTCCATTCCGACAGCATCGCCAGTCACTATGGCGTGC 

TGC 

TAGCGCTATATGCGTTGATGCAATTTCTATGCGCACCCGTTCTC 

GG 

AGCACTGTCCGACCGCTTTGGCCGCCGCCCAGTCCTGCTCGCT 
10 TCG 

CTACTTGGAGCCACTATCGACTACGCGATCATGGCGACCACAC 
CCG 

TCCTGTGGATCTATCGAATCTAAATGTAAGTTAAAATCTCTAA 
ATA 

15 ATTAAATAAGTCCCAGTTTCTCCATACGAACCTTAACAGCATT 

GCG 

GTGAGCATCTAGACCTTCAACAGCAGCCAGATCCATCACTGCT 
TGG 

CCAATATGTTTCAGTCCCTCAGGAGTTACGTCTTGTGAAGTGA 
20 TGA 

acttctggaaggttgcagtgttaactccgctGtattgacgggc 

ATA 

tccgtacgttggcaaagtgtggttggtaccggaggagtaatct 

CCA 

25 CAACTCTCTGGAGAGTAGGCACCAACAAACACAGATCCAGCG 

TGT 

TGTACTTGATCAACATAAGAAGAAGCATTCTCGATTTGCAGGA 
TCA 

AGTGTTCAGGAGCGTACTGATTGGACATTTCCAAAGCCTGCTC 

30 GTA 
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GGTTGCAACCGATAGGGTTGTAGAGTGTGCAATACACTTGCGT 
AC 

AATTTCAACCCTTGGCAACTGCACAGCTTGGTTGTGAACAGCA 
TCT 

TCAATTCTGGCAAGCTCCTTGTCTGTCATATCGACAGCCAACA 
GAA 

TCACCTGGGAATCAATACCATGTTCAGCTTGAGCAGAAGGTCT 
GA 

GGCAACGAAATCTGGATCAGCGTATTTATCAGCAATAACTAGA 
AC 

TTCAGAAGGCCCAGCAGGCATGTCAATACTACACAGGGCTGA 
TGT 

GTCATTTTGAACCATCATCTTGGCAGCAGTAACGAACTGGTTT 
CCT 

GGACCAAATAITTTGTCACACTTAGGAACAGTTTCTGTTCCGT 
AAG 

CCATAGCAGCTACTGCCTGGGCGCCTCCTGCTAGCACGATACA 
CTT 

AGCACCAACCTTGTGGGCAACGTAGATGACTTCTGGGGTAAG 
GGT 

ACCATCCTTCTTAGGTGGAGATGCAAAAACAATTTCTTTGCAA 
CCA 

GCAACTTTGGCAGGAACACCCAGCATCAGGGAAGTGGAAGGC 
AGA 

ATTGCGGTTCCACCAGGAATATAGAGGCCAACTTTCTCAATAG 
GTC 

TTGCAAAACGAGAGCAGACTACACCAGGGCAAGTCTCAACTT 
GCA 

ACGTCTCCGTrAGTTGAGCTTCATGGAATTTCCTGACGTTATCT 
ATA 
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GAGAGATCAATQGCTCTCTTAACGTTATCTGGCAATTGCATAA 
GTT 

CCTCTGGGAAAGGAGCTTCTAACACAGGTGTCTTCAAAGCGAC 
TCC 

5 ATCAAACTTGGCAGTTAGTTCTAAAAGGGCTTTGTCACCATTT 

TGA 

CGAACATTGTCGACAATTGGTTTGACTAATTCCATAATCTGTTC 

CG 

TTTTCTGGATAGGACGACGAAGGGCATCTTCAATTTCTTGTGA 
10 GGA 

GGCCTTAGAAACGTCAATTTTGCACAATTCAATACGACCTTCA 
GAA 

GGGACTTCTTTAGGTTTGGATTCTTCTTTAGGTTGTTCCTTGGT 
GTA 

15 TCCTGGCTTGGCATCTCCTTTCCTTCTAGTGACCTTTAGGGACT 

TCA 

TATCCAGGTTTCTCTCCACCTCGTCCAACGTCACACCGTACTTG 
GC 

ACATCTAACTAATGCAAAATAAAATAAGTCAGCACATTCCCAG 
20 GC 

TATATCTTCCTTGGATTTAGCTTCTGCAAGTTCATCAGCTTCCT • 
CCC 

TAATTTTAGCGTTCAAACAAAACTTCGTCGTCAAATAACCGTT 
TGG 

25 TATAAGAACCTTCTGGAGCATTGCTCTTACGATCCCACAAGGT 

GCT 

TCCATGGCTCTAAGACCCnTGATTGGCCAAAACAGGAAGTGC 
GTT 

CCAAGTGACAGAAACCAACACCTGTTTGTTCAACCACAAATTT 
30 CAA 
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GCAGTCTCCATCACAATCCAATTCGATACCCAGCAACTTTTGA 

GTT 

CGTCCAGATGTAGCACCTTTATACCACAAACCGTGACGACGAG 
ATT 

GGTAGACTCCAGTTTGTGTCCTTATAGCCTCCGGAATAGACTT 
TTT 

GGACGAGTACACCAGGCCCAACGAGTAATTAGAAGAGTCAGC 

CAC 

CAAAGTAGTGAATAGACCATCGGGGCGGTCAGTAGTCAAAGA 
CGC 

C AAC AAAATTTCACTG ACAGGG AACTTTTTGAC ATCTTC AGAA 
AGT 

TCGTATTCAGTAGTCAATTGCCGAGCATCAATAATGGGGATTA 

TAG 

CAGAAGCAACAGTGGAAGTCACATCTACCAACTTTGCGGTCTC 
AG 

AAAAAGCATAAACAGTTCTACTACCGCCATTAGTGAAACTTTT 
CAA 

ATCGCCCAGTGGAGAAGAAAAAGGCACAGCGATACTAGCATT 
AGO 

GGGCAAGGATGCAACTTTATCAACCAGGGTCCTATAGATAACC 
CT 

AGCGCCTGGGATCATCCTTTGGACAACTCTTTCTGCCAAATCT 
AGG 

TCCAAAATCACTTCATTGATACCATTATTGTACAACTTGAGCA 
AGT 

TGTCGATCAGCTCCTCAAATTGGTCCTCTGTAACGGATGACTC 
AAC 

TTGCACATTAACTTGAAGCTCAGTCGATTGAGTGAACTTGATC 
AGG 
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TTGTGCAGCTGGTCAGCAGCATAGGGAAACACGGCTTTTCCTA 

CCA 

AACTCAAGGAATTATCAAACTCTGCAACACTTGCGTATCKIAGG 
TA 

5 GCAAGGGAAATGTCATACTTGAAGTCGGACAGTGAGTGTAGT 

CTT 

GAGAAATTCTGAAGCCGTATmTATTATCAGTGAGTCAGTCA 
TCA 

GGAGATCCTCTACGCCGGACGCATCGTGGCCGACCTGCAGGTC 
10 GG 

CATCACCGGCGCCACAGGTGCGGTTGCTGGCGCCTATATCGCC 
GAC 

ATCACCGATGGGGAAGATCGGGCTCGCCACTTCGGGCTCATGA 
GC 

15 GCTTGTTTCGGCGTGGGTATGGTGGCAGGCCCCGTGGCCGGGG 

GA 

CTGTTGGGCGCCATCTCCTTGGACCTGCAGGGGGGGGGGGGG 
AAA 

GCCACGTTGTGTCTCAAAATCTCTGATGTTACATTGCACAAGA 
20 TAA 

AAATATATCATCATGAACAATAAAACTGTCTGCTTACATAAAC 
AGT 

AATACAAGGGGTGTTATGAGCCATATTCAACGGGAAACGTCTT 
GC 

25 TCAAGGCCGCGATTAAATTCCAACATGGATGCTGATTTATATG 

GGT 

ATAAATGGGCTCGCGATAATGTCGGGCAATCAGGTGCGACAA 
TCT 

ATCGATTGTATGGGAAGCCCGATGCGCCAGAGTTGTTTCTGAA 
30 ACA 
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TGGCAAAGGTAGCGTTGCCAATGATGTTACAGATGAGATGGTC 

AG 

ACTAAACTGGCTGACGGAATTTATGCCTCTTCCGACCATCAAG 
CAT 

5 TTTATCCGTACTCCTGATGATGCATGGTTACTCACCACTGCGAT 

CC 

CCGGGAAAACAGCATTCCAGGTATTAGAAGAATATCCTGATTC 
AG 

GTGAAAATATTGTTGATGCGCTGGCAGTGTTCCTGCGCCGGTT 
10 GCA 

TTCGATTCCTGTTTGTAATTGTCCTTTTAACAGCGATCGCGTAT 
TTC 

GTCTCGCTCAGGCGCAATCACGAATGAATAACGGTTTGGTTGA 
TGC 

15 GAGTGATTTTGATGACGAGCGTAATGGCTGGCCTGTTGAACAA 

GTC 

TGGAAAGAAATGCATAAGCTTTTGCCATTCTCACCGGATTCAG 
TCG 

TCACTCATGGTGATTTCTCACTTGATAACCITATTTTTGACGAG 
20 GG 

GAAATTAATAGGTTGTATTGATGTTGGACGAGTCGGAATCGCA 
GA 

CCGATACCAGGATCTTGCCATCCTATGGAACTGCCTCGGTGAG 
TTt 

25 TCTCCTTCATTACAGAAACGGCTTTTTCAAAAATATGGTATTG 

ATA 

ATCCTGATATGAATAAATTGCAGTrrCATTTGATGCTCGATGA 
GTT 

TTTCTAATCAGAATTGGTTAATTGGTTGTAACACTGGCAGAGC 
30 ATT 
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ACGCTGACTTGACGGGACGGCGGCTTTGTTGAATAAATCGAAC 

TTT 

TGCTGAGTTGAAGGATCAGATCACGCATCTTCCCGACAACGCA 
GA 

CCGTTCCGTGGCAAAGCAAAAGTTCAAAATCACCAACTGGTCC 
AC 

CTACAACAAAGCTCTCATCAACCGTGGCTCCCTCACTTTCTGG 

CTG 

GATGATGGGGCGATTCAGGCCTGGTATGAGTCAGCAACACCTT 
CTT 

CACGAGGCAGACCTCAGCGCCCCCCCCCCCCTGCAGGTCCCAC 
GG 

CGGCGGTGCTCAACGGCCTCAACCTACTACTGGGCTGCTTCCT 
AAT 

GCAGGAGTCGCATAAGGGAGAGCGTCGAGTATCTATGATTGG 
AAG 

TATGGGAATGGTGATACCCGCATTCTTCAGTGTCTTGAGGTCT 
CCT 

ATCAGATTATGCCCAACTAAAGCAACCGGAGGAGGAGATTTC 
ATG 

GTAAATTTCTCTGACTTTTGGTCATCAGTAGACTCGAACTGTG 
AGA 

CTATCTCGGTTATGACAGCAGAAATGTCCTTCTTGGAGACAGT 

AAA 

TGAAGTCCCACCAATAAAGAAATCCTTGTTATCAGGAACAAAC 
TTC 

TTGTTTCGAACTTTTTCGGTGCCTTGAACTATAAAATGTAGAGT 
GG 

ATATGTCGGGTAGGAATGGAGCGGGCAAATGCTTACCTTCTGG 
AC 
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CTTCAAGAGGTATGTAGGGTTTGTAGATACTGATGCCAACTTC 
AGT 

GACAACGTTGCTATTTCGTTCAAACCATTCCGAATCCAGAGAA 
ATC 

AAAGTTGTTTGTCTACTATTGATCCAAGCCAGTGCGGTCTTGA 
AAC 

TGACAATAGTGTGCTCGTGTTTTGAGGTCATCTTTGTATGAATA 
AA 

TCTAGTCTTTGATCTAAATAATCTTGACGAGCCAAGGCGATAA 
ATA 

CCCAAATCTAAAACTCTTTTAAAACGTTAAAAGGACAAGTATG 
TCT 

GCCTGTATTAAACCCCAAATCAGCTCGTAGTCTGATCCTCATC 
AAC 

TTGAGGGGCACTATCTTGTTTTAGAGAAATTTGCGGAGATGCG 
ATA 

TCGAGAAAAAGGTACGCTGATTTTAAACGTGAAATTTATCTCA 
AG 

ATCTCTGCCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTG 
ACA 

CATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGAT 
GC 

CGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGG 
CGG 

GTGTCGGGGCGCAGCCATGACCCAGTCACGTAGCGATAGCGG 
AGT 

GTATACTGGCTTAACTATGCGGCATCAGAGCAGATTGTACTGA 
GA 

GTGCACCATATGCGGTGTGAAATACCGCACAGATGCGTAAGG 
AGA 
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AAATACCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACTGACT 
CGC 

TGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTC 
AAA 

GGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGG 
AAA 

GAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAA 
AA 

AGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGA 
CGA 

GCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCC 
GAG 

AGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTC 
GTG 

CGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGC 
CTT 

TCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGT 
AGG 

TATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTG 
TGC 

ACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAA 
CTA 

TCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTG 
GCA 

GCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGC 
GGT 

GCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTA 
GA 

AGGACAGTATrTGGTATCTGCGCTCTGCTGAAGCCAGTTACCT 
TCG 
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GAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCG 

CTG 

GTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAG 
AAA 

AAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCT 
GAC 

GCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGA 
GA 

TTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAAT 
GAA 

■ 

GTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAG 

TTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTA 

TTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTAC 

GATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACC 

GCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCA 

GCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATC 

CGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGT 

AGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAG 

GCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTC 

CGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGC 
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AAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTA 

AGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAA 

TTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTG 

AGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGA 

GTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAG 

CAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGA 

AAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAAC 

CCACTCGTGCACCCAACTGATCITCAGCATCTTTTACTTTCACCAGC 

GTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAA 

GGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTT 

TTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCG 

GATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTC 

CGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCA 

TTATTATCATGACATTAACCTATAAAAATAGGCGTATCACGAGGCC 

CTTTCGTCTTCAAGAATTAATTCTCATGTTTGACAGCTTATCATCGA 
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TAAGCTGACTCATGTTGGTATTGTGAAATAGACGCAGATCGGGAA 
CACTGAAAAATAACAGTTATTATTCGAGATC 

PEAG65 

CTAAATTGTAAGCGTTAATATTTTGTTAAAATTCGCGTTAAATTTTT 

7 

GTTAAATCAGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAAT 

CCCTTATAAATCAAAAGAATAGACCGAGATAGGGTTGAGTGTTGT 
SEQID 

TCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTCCAA 

N0:35 

CGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGCCCACTACG 

TGAACCATCACCCTAATCAAGTTTTTTGGGGTCGAGGTGCCGTAAA 

GCACTAAATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGA 

CGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGGAAGAAAGC 

GAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCGGTCACGCT 

GCGCGTAACCACCACACCCGCCGCGCTTAATGCGCCGCTACAGGG 

CGCGTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCG 

ATCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGG 

ATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAG 
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TCACGACGTTGTAAAACGACGGCCAGTGAGCGCGCGTAATACGAC 

TCACTATAGGGCGAATTGGGTACCGGGCCCTCTAGATCCTTTCAGC 

TCCCTGCCCCGGACATGCCCAGTGGGTGGAAGCTGCCCTCTTCTAG 

CAGGAGACGCCCCAGGCGGTAGAGCAGCTGGGGGTACCAATGCAC 

ACCCTCCCCCGGaGTCCAGCTGCCCCATGCCAAGCTGTGAAAGAGT 

CTCAGGGGCCAGAAGGCCAACTGAGCCAGGTGGTGGTCAGCCACG 

GCCGCGAAGCAGGATGCCACCACATCCTCCACCACCAGTGTCC 
CA 

TGCTTTGTGAGCGGGGCGTAGGCCCCGAGGGCCACGTGTGTAG 
AG 

ACAGCTGCCACGCGGGCAGGCTGCAGGCCTGGCACCCCAGCC 
ACC 

AGCACGTACTGGCCAGGCTGCACGTGGCTGGCAAATGTGGCC 
CGG 

AAGCGGGCTGCCGGCTCCGTGTGATTGTCAGCCGTAAAGAGC 
AGG 

TGAGCGGGTGTGAGTGCCAGGCGGCGTGGGGGGTCCTGAGTC 
TCG 

ATGACCTGGAAGGCTCTCAGCCTGTGGGGCTCGCGGTCCAGGA 
AA 

ATGAGCACATCGCTGAAGGTGGGGCTCCCATCCTCCCCCATGG 
CCA 

GCACACGGTCTCCCGGCCTCACGGCTGACAAGGCCACACGCG 

CCC 
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CACTCTccaggcgtacctgggc^cggccgcgaateagccgcccgtcttggCTGCGGCC 
GAG 

TGCTCGGACTTGACGGAGCAATGCACGTGGGCCTTTGACTCGT 
AAT 

ACACCCAGTCAAAGCCGGCCTCCACTGCCAAGCGCGCCAGCA 
GTC 

CATACTTATTGCGGTCGCGGTCTGATGTGGTGATGTCCACCGC 
GCG 

GCCCTCATAATGCAGGGACTCCTCTGAGTGGTGGCCGTCCTCG 
TCC 

CAGCCCTCGGTCACCCGCAGCTTCACACCGGGCCACTGGTTCA 
TCA 

CCGAGATAGCCAGCGAGTTCAGGCGGTCCTTGCAGCGCTGGGT 
CA 

TGAGGCGGTCGGCGCCTGTGTTCTCCTCGTCCTTGAAGATGAT 
GTC 

TGGATTGTAATTGGGGGTGAGCTCCTTGAAGCGCTCGGAGCTG 
CG 

AGCGATCTTGCCTTCATAGCGTCCGCTGGCGCCCAGGGTCTTC 
TCG 

GGCACATTGGGGCTGAACTGCTTGTAGGCGAGCGGCACGAGT 
TTG 

CGTGGCGGTCGCCGGCGGCTGCCCACCACCCGACCCGGCCCGC 
AG 

CCCCATGCCGCcGGCACCACCAGCAGCAGCAACAGGACCAGG 
CAG 

AAGTGCAGTCGGGGCCGGAGCCGggcgggagacatggcggccgcgacggtat 
cgata 

agcTTGATATCGAATTCCTGCAGCCCGGGGGATCCACTAGTTCT 
AGA 
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GCGGCCGCCACCGCGGTGGAGCTCCAGCnTTGTTCCCTTTAG 
TGA 

GGGTTAATTGCGCGCTTGGCGTAATCATGGTCATAGCTGTTTC 
CTG 

TGTGAAATTGTTATCCGCTCACAATTCCACACAACATACGAGC 
CGG 

AAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTA 

ACT 

CACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGA 
AAC 

CTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGA 
GA 

GGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGA 
CTC 

GCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCAC 
TCA 

AAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCA 
GGA 

AAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGT 
AA 

AAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCT 
GAG 

GAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAAC 

CCG 

ACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCC 
TCG 

TGCGCTCTCCTGITCCGACCCTGCCGCTTACCGGATACCTGTCC 
GC 

CirrCTCCCrrCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCT 
GT 
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AGGTATCTCAOTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCT 
GTG 

TGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGG 
TAA 

CTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCA 
CTG 

GCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTA 

GGC 

GGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACA 
CTA 

GAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTAC 
CTT 

CGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCAC 

CGC 

TGGTAGCGGTGGTXrmTGTTTGCAAGCAGCAGATTACGCGCAGA 

AAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTG 

ACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGA 

GATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATG 

AAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGAC 

AGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTC 

TATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACT 

ACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATA 
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CCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAAC 

CAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTA 

TCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAA 

GTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTAC 

AGGCATCGTGGTGTCACGCTCGTCGITTGGTATGGCTTCATTCAGC 

TCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGT 

GCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAG 

TAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCAT 

AATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGG 

TGACGCGTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACC 

GAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACAT 

AGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGC 

GAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTA 

ACCCACTCGTGCACCCAACTGATCTTCAGCATCrrTTACTTTCACCA 

GCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAA 
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AGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCC 
TTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGC 

5 

GGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTT 
CCGCGCACATTTCCCCGAAAAGTGCCAC 

pEAG65 

10 CrAAATTGTAAGCGTTAATATTTTGTTAAAATTCGCGTTAAATTTTT 

GTTAAATCAGCTCATITTTTAACCAATAGGCCGAAATCGGCAAAAT 

CCCTTATAAATCAAAAGAATAGACCGAGATAGGGTTGAGTGTTGT 
15 SEQ ED 

TCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTCCAA 

N0:36 

CGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGCCCACTACG 
20 . TGAACCATCACCCTAATCAAGTITTTTGGGGTCGAGGTGCCGTAAA 

GCACTAAATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGA 
CGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGGAAGAAAGC 

25 

GAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCGGTCACGCT 

GCGCGTAACCACCACACCCGCCGCGCTTAATGCGCCGCTACAGGG 

30 CGCGTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCG 
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ATCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGG 

ATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAG 

TCACGACGTTGTAAAACGACGGCCAGTGAGCGCGCGTAATACGAC 

TCACTATAGGGCGAATTGGGTACCGGGCCCTCTAGATCCTTTCAGC 

TCCCTGCCCCGGACATGCCCAGTGGGTGGAAGCTGCCCTCTTCTAG 

CAGGAGACGCCCCAGGCGGTAGAGCAGCTGGGGGTACCAATGCAC 

ACCCTCCCCCGGaGTCCAGCTGCCCCATGCCAAGCTGTGAAAGAGT 

CTCAGGGGCCAGAAGGCCAACTGAGCCAGGTGGTGGTCAGCCACG 

GCCGCGAAGCAGGATGCCACCACATCCTCCACCACCAGTGTCC 
CA 

TGCTTTGTGAGCGGGGCGTAGGCCCCGAGGGCCACGTGTGTAG 
AG 

ACAGCTGCCACGCGGGCAGGCTGCAGGCCTGGCACCCCAGCC 
ACC 

AGCACGTACTGGCCAGGCTGCACGTGGCTGGCAAATGTGGCC 
CGG 

AAGCGGGCTGCCGGCTCCGTGTGATTGTCAGCCGTAAAGAGC 
AGG 

TGAGCGGGTGTGAGTGCCAGGCGGCGTGGGGGGTCCTGAGTC 
TCG 

ATGACCTGGAAGGCTCTCAGCCTGTGGGGCTCGCGGTCCAGGA 
AA 
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ATGAGCACATCGCTGAAGGTGGGGCTCCCATCCTCCCCCATGG 
CCA 

GCACACGGTCTCCCGGCCTCACGGCTGACAAGGCCACACGCG 
CCC 

CACTCTCCAGGCGTACCTgggctccggcagggtcgacgccgcccgtcttggCTGC 
GG 

CCGAGTGCTCGGACTTGACGGAGCAATGCACGTGGGCCTTTGA 
CTC 

GTAATACACCCAGTCAAAGCCGGCCTCCACTGCCAAGCGCGCC 
AG 

CAGTCCATACTTATTGCGGTCGCGGTCTGATGTGGTGATGTCC 
ACC 

GCGCGGCCCTCATAATGCAGGGACTCCTCTGAGTGGTGGCCGT 

ccn 

CGTCCCAGCCCTCGGTCACCCGCAGCTTCACACCGGGCCACTG 
GTT 

CATCACCGAGATAGCCAGCGAGTTCAGGCGGTCCTTGCAGCGC 
TG 

GGTCATGAGGCGGTCGGCGCCTGTGTTCTCCTCGTCCTTGAAG 

ATG 

ATGTCTGGATTGTAATTGGGGGTGAGCTCCTTGAAGCGCTCGG 
AGC 

TGCGAGCGATCTTGCCTTCATAGCGTCCGCTGQCGCCCAGGGT 
CTT 

CTCGGGCACATTGGGGCTGAACTGCTTGTAGGCGAGCGGCAC 
GAG 

TTTGCGTGGCGGTCGCCGGCGGCTGCCCACCACCCGACCCGGC 
CCG 

CAGCCCCATGCCGCcGGCACCACCAGCAGCAGCAACAGGACC 
AGG 
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CAGAAGTGCAGTCGGGGCCGGAGCCGggcgggagacatggcggccgcgac 
ggtat 

cgataagcTTGATATCGAATTCCTGCAGCCCGGGGGATCCACTAGT 
TCT 

AGAGCGGCCGCCACCGCGGTGGAGCTCCAGCTTTTGTTCCCTT 
TAG 

TGAGGGTTAATTGCGCGCTTGGCGTAATCATGGTCATAGCTGT 

TTC 

CTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACG 
AGO 

CGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAG 
CTA 

ACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGG 
GA 

AACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGG 
GG 

AGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCA 
CTG 

ACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGC 
TCA 

CTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAA 
CGC 

AGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGA 
ACC 

GTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCC 
CCC 

TGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCG 
AAA 

CCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGC 
TC 
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CCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACC 

TGT 

CCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCA 
CG 

CTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTG 
GGC 

TGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTAT 

CCG 

GTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATC 
GCC 

ACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTA 
TGT 

AGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGC 
TA 

CACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCA 
GTT 

ACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAA 
ACC 

ACCGCrGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTA 

CGC 

GCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCmTCTAC 
GG 

GGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTT 
GG 

TCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAA 
TTA 

AAAATGAAGTriTAAATCAATCTAAAGTATATATGAGTAAACT 
TGG 

TCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAG 
CGA 

-209- 



PCT/USOl/19435 



TCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTG 
TAG 

ATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTG 
CA 

ATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAG 
CA 

ATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCT 

GCA 

ACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGC 
TA 

GAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGC 
CAT 

TGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCT 
TCA 

TTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCC 
CCA 

TGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGT 
TGT 

CAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCA 
GCA 

CTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTC 
TGT 

GACTGGTGACGCGTCAACCAAGTCATTCTGAGAATAGTGTATG 

CG 

GCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACC 
GC 

GCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGT 
TCT 

TCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCA 
GTT 

-210- 



wo 01/98344 PCTAJSOl/19435 

CGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTT 

AC 

TTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAA 
TGC 

5 CGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACT 

CAT 

ACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATT 
GTC 

TCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACA 
10 AATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCAC 



Construction of Hedgehog-Ig Fusion Proteins 

Shh-Fc(muIgGl) plasmid pUBl 14 (SEQ ID NO: 32), has the wild-^e SHH 
IS domain (SEQ ID NO: 21 or 23) fused to the CH2 and CH3 regions of murine IgGI 
(SEQ ID NO: 29). 

The Fc region in pUBlH contJiins a glycosylation site mutation [Asn297Gln]. 
Plasmid pUB5S (SEQ ID NO: 31)andpUB 1 14 plasmids are identical outside of tiie 
region coding for the Fc domain fiised to SHH. Plasmids identical to pUB 114, but 

20 containing the human IgGI or murine IgG2a Fc region are pUB 115 (SEQ ID NO: 33) 
and pUB 116 (SEQ ID NO: 34), respectively. 

For construction of yeast strains exprrasing protein, plasmids >vere digested with 
Stul and transformed into Pichia pastoris GSl IS by electcoporation in IM Sorbitol 
(Invitrogen) or by a Li salt transformation procedure (Frozen EZ Yeast Transformation 

25 kit, Zymo Research, Orange, CA). His+ transformants were selected on MD agar. 

Colonies were purified on YPD agar and cultured for protein expression in 5 ml BMMY 
(2% Methanol) medium. BMMY cxilture sv5)ematants were harvested at 1 or 2 days (1- 
day harvests w^ concentrated by TCA precipitation) and were analyzed by SDS- 
PAGE and Coomassie blue staining to distinguish clipped and undipped SHE 
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Protein Purification 

Large scale preparations of protein for purification were prepared as follows: An 
inoculum in BMGY (late log to stationary phase) was added to 1 L BMGY in a 
Fembach flask and incubated at 150 ipm for 2-3 days. The stationary phase BMGY 

5 cvdture was centrifuged and the cell pellet firom 1 L was resuspended in BMMY(2% 
Methanol) and incubated in a Fembach flask at 30 C for 2-3 days. Pepstatin A (44 
microM) was added to EMMY medium for expression of SHH-Fc fusion proteins. 
A. Purification of Hedgehog-Ig fusion protein constructs 
Pichia cells were removed firom the conditioned medium by centrifiigation 

10 before application to Protein A Fast Flow ® (Pharmacia). Protein from constructs 

utilizing human IgGI (SEQ ID NO: 28) or murme IgG2A sequences (SEQ ID NO: 30) 
were applied directly to the Protein A. Constructs utilizing murine IgG 1 sequences 
were diluted ten-fold with water to reduce the salt concentration, re-concentrated using a 
3K cutoff spiral filter (Amicon) and the pH adjusted with the addition of sodium borate 

15 buffer, pH 8.S to a final concentration of 50 mM. 

HHIg was eluted with 25 mM sodium phosphate, pH 2.8, and the firactions 
collected into tubes containing 0.1 volume of 0.5 M sodium phosphate pH 6 to readjust 
the pH, The Protein A eluant was then diluted eight-fold with 0.5 mM sodium 
phosphate, pH 6 and applied to a CM-Poros® column (Perseptive Biosystems) 

20 equilibrated with 50 mM sodium phosphate, pH 6.0. Elution with a gradient of 0-0.8 M 
NaCI separated two HHIg peaks. 

The first is "one-armed" protein in which one of the HHIg pplypeptides of the 
dimer is proteolytically cleaved at a sequence near the hinge and therefore this dimer 
contains only one HH N-terminal domain. The second peak is the dimer with two fiiU- 

25 lengdi HHIg chains. The peaks were pooled separately, reduced with 1 0 mM DTT and 
dialyzed against 5 mM sodium phosphate, pH 5.5, 150 mM NaCI and 0.5 mM DTT. No 
DTT was used when the N-tenninal cysteine of the protein was replaced with other 
amino acids. These two purification steps achieve >9S% purity. Purity was determined 
by SDS-PAGE on 4-20% gradient gels (Novex) stained witii Coomassie Blue. Identity 
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was confirmed by mass spectrometry, and potency was analyzed using a cell-based 
bioactivity assay (see above). 
Mass spectrometry 

The molecular masses of the purified proteins were determined by electrospray 
5 ionization mass spectroscopy (ESI-MS) on a Micromass Quattro n triple qxiadrupole 
mass spectrometer. Samples were desalted using an on-line Michrom Ultrafast 
Microprotein Analyzer system with a Reliasil© C4 colwnn (1 mm x 5 cm). All 
electrospray mass spectral data were processed using the Micromass MassLynx data 
system. 
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We claim: 

1. A method of promoting angiogenesis in a subject animal comprising administering 
to the subject an angiogenic amount of a hedgehog polypeptide or agonist thereof. 

5 

2. The method of claim 1, wherein the step of administering comprises contacting the 
hedgehog polypeptide or agonist with a mesenchymal cell of the subject 

3. The method of claim 1 , comprising administering to the subject a polypeptide 

10 including a hedgehog amino acid sequence, which hedgehog sequence directs the binding 
of the polypeptide to a patched receptor polypeptide and/or induces alkaline phosphatase 
activity in C3H10T1/2 cells. 

4. The method of claim 1, comprising administering to the subject a polypeptide 

15 including a hedgehog amino acid sequence having at least 60% amino acid identity with 
SEQIDNo. 10-18 or 20. 

5. The method of claim 1 , comprising administering to the subject a polypeptide 
including a hedgehog amino acid sequence encoded by a coding sequence which 

20 hybridizes imder stringent conditions to any of SEQ ID No. 1-9 or 19. 

6. The method of claim 1 , comprising administering to the subject a polypeptide 
including a hedgehog amino acid sequence represented by SEQ ID No. 26. 

25 7, The method of any of claims 3-7, wherein the hedgehog sequence includes at 
least 50 resdiues of an extracellular domain of a hedgehog protein. 

8. The method of any of claims 3-7, wherein the polypeptide is derivatized with one 
or more chemical moieties. 

30 

9. The method of claim 8, wherein the chemical moiety is a polyalkylene glycol 
polymer. 

1 0. The method of claim 8, wherein the chemical moiety is a hydrophobic moiety. 

1 1 . The method of claim 1 0, wherein the hydrophobic moiety is a Upid. 

12. The method of claim 8, wherein tiie chemical moiety is one or more phosphate 
groups. 

40 

13. The method of claim 8, wherein the chemical moiety is one or more acetyl groups. 

14. The method of claim 8, wh^ein the chemical moiety is one or more sugar or 
carbohydrate groups. 

45 

1 5. The method of claim 8, whmin the chemical moieties are any combination of 
phosphate, acetyl, sugar, carbohydrate, or hydrophobic moieties. 
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1 6. The metibod of claim 4, wherein the metiiod further comprises administering an 
agent that enhances agonistic properties of tiie hedgehog therapeutic. 

17. The method of claim 16, wherein the agent is an angiogenic factor selected fix>m 
5 the group consisting of vascular endothelial growth factor (VEGF), hepatocyte growth 

factor (HGF), basic fibroblast growth factor (bFGF), angiopoietin 1, angiopoietin 2, 
monocyte chemotactic protein-l (MCP-1). 

18. A method of inhibiting angiogenesis in a subject animal comprising administering 
10 to the subject an antiangiogenic amount of a hedgehog antagonist. 

19. The method of claim 18, comprising administering a polypeptide including one or 
more antigen binding domains which bind to and inhibit hedgehog signalling. 

15 20. The method of claim 18, comprising administering a polypeptide including one or 
more antigen binding domains which bind to patched and inhibit hedgehog signalling. 

21. The method of claim 18, con:q)rising administering a polypeptide including one or 
more antigen binding domains which bind to smoothened and inhibit hedgehog signalling. 

20 

22. The method of claim 19, 20 or 21, wherein the antigen binding domain is part of a 
an antibody structure selected jfrom the group consisting of a humanized antibody 
homology, a human antibody homolog, a chimeric antibody homolog and fragments 
thereof. 

23. The method of claim 18, comprising administering a functional antagonist of a 
hedgehog therapeutic. 

24. The method of claim 18, or 20, wherein the subject has a condition selected from 
30 the group consisting of a malignant tumor, retinopathy, macular degeneration, a 

nonmalignant tumor, rheumatoid arthritis, osteoarthritis, neovascular glaucoma, keloids, 
Crohn's disease, ulcerative colitis, and psoriasis. 



25 



35 



25. The method of claim 1, wherein flie hedgehog agonist is a small organic molecule. 

26. The method of claim 25, wherein the hedgehog agonist has a molecular weight less 
than 2500 amu. 



27. The method of claim 25, wherein the hedgehog agonist is represented by general 
40 formula QCD): 



Formula Xn 



Ar* 

1 



. Y 



1 



I 



Y 

cy -224- 



1 
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wherein, as valence and stability permit. 

At and Ar' independently represent substituted or unsubstituted aryl or heteroaryl 

rings; 

Y, independently for each occurrence, may be absent or represent -N(R)-, -0-, 
5 or -^!e-; 

X can be selected from -C(=0)-, -C(=S>, -S(02K -S(0)-, -C(=NCN)-, - 
P(=0)(0R2)-, and a methylene group optionally substituted with 1-2 groups such as lower 
alkyl, alkenyl, or alkynyl groups; 

M represents, independently for each occurrence, a substituted or unsubstituted 
10 methylene group, such as -CH2-, -CHF-, -CHOH-, -CH(Me)-, -C(=0)-, etc., or two M 
taken together represent substituted or unsubstituted ethene or etiiyne; 

R represents, independently for each occurrence, H or substituted or unsubstituted 
aryl, heterocyclyl, heteroaryl, aralkyl, heteroaraUcyl, alkynyl, alkenyl, or alkyl, or two R 
taken together may form a 4- to 8-membered ring, e.g., with N; 
J 5 Cy and Cy' indepradenly represent substituted or unsubstituted aryl, heterocyclyl, 

heteroaryl, or cycloalkyl, including polycyclic groups; 

i represents, independently for each occurrence, an integer from 0 to 5, preferably 
from 0 to 2; and 

n, individually for each occurence, represents an integer from 0 to 10, preferably 
20 from 0 to 5. 

28. The method of any of claims 3 — 7, comprising administering a nucleic acid 
sequence encoding the polypeptide. 

25 29. The method of claim 29, wherein the nucleic acid sequences encoding the 
polypeptide are introduced via a viral vector, via hpofection, and/or as naked DNA. 

30. The method of claim 1 8, wherein the hedgehog antagonist is a small organic 
molecule. 

30 

3 1 . The method of claim 30, wherein the hedgehog antagonist has a molecular weight 
less than 2S00 amu. 

32. The method of claim 30, wherein the hedgehog antagonist is represented by one or 
35 more of formulas I — XI. 
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33. The me&od of claim 30, wherein the hedgehog antagonist is represented by 
general formula (I): 



I 

Y2 



wherein, as valence and stability permit, 

and R2, independentiy for each occurrence, represent H, lower alkyl, aryl (e.g., 

substituted or unsubstituted), aralkyl (e.g., substituted or unsubstituted, e.g., -(CH2)naryl), 
or heteroaryl (e.g., substituted or unsubstituted), or heteroaralkyl (e.g., substituted or 
unsubstituted, e.g., -(CH2)nheteroaralkyl-); 

L, independently for each occurrence, is absent or represents -(CH2)ii-alkyl, - 
alkenyl-, -alkynyl-, -(CH2)nalkenyl., -(CH2)nalkynyl-, '(CH2)nO(CH2)p-, - 
(CH2)nNR2(CH2)p-, -(CH2)nS(CH2)p-, "(CH2)nalkenyl(CH2)p-, -(CH2)nalkynyl(CH2)p-, 

-0(CH2)ii-, -NR2(CH2)n-, or -S(CH2)nS 

Xj and X2 can be selected, independently, from -N(Rg)-, -0-, -S-, -Se-, -N=N-, - 

ON=CH-, -(Rg)N-N(R8)-, -ONCRg)-, a heterocycle, or a direct bond between L and Yj or 

Y2, respectively; 

Yi and Y2 can be selected, independently, from -C(=0>-, -C(=S)-, -S(02)-, -S(0)-, 
-C(=NCN)-> -P(=0)(0R2)-, a het^oaromatic group, or a direct bond between Xi and Zj or 
X2 and Z2, respectively; 

Zi and Z2 can be selected, independently, from -NCRg)-, -0-, -S-, -Se-, -N=N-, - 
ON=CH-, -RgN-NRg-, -ONRg-, a heterocycle, or a direct bond between Yj or Y2, 
respectively, and L; 

Rg, independently for each occurrence, represents H, lower alkyl, -(CH2)naryl 

(e.g., substituted or unsubstituted), -(CH2)nheteroaryl (e.g., substituted or unsubstituted), 

or two Rg taken together may form a 4- to 8-membered ring, e.g., with X^ and Z\ or X2 

and Zi, which ring may include one or more carbonyls; 

p represents, independently for each occurrence, an integer from 0 to 10, preferably 
from 0 to 3; and 

n, individually for each occurence, represents an integer from 0 to 10, preferably from 0 to 
5. 
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Figure 1 : Alignment of N-teiminal fragments of Human Hedgehog Proteins 
1 

Indian CGPGRWGSR RRPPRK-LVP LAYKQFSPNV PBKTLGASGR YEGKIARSSB 
Sonic CGPGRGFG-K RRHPKK-LTP LAYKQFIPNV AEKTLGASGR YEGKISRNSE 
Desert CGPGRGPVGRRRYARKQLVPLLYKQFVPGV PERTLGASGP AEGRVARGSE 



51 

Indian RFKELTPNYN PDHFKDEEN TGADRLMTQR CKDRLNSLAI SVMNQWPGVK 

Sonic RFKELTPNYN PDIIFKDEEN TGADRLMTQR CKDKLNALAI SVMNQWPGVK 

Desert RFRDLVPNYN PDIIFKDEEN SGADRLMTER CKERVNALAI AVMNMWPGVR 



101 

Indian LRVTEGWDED GHHSEESLHY EGRAVDITTS DRDRNKYGLL ARLAVEAGFD 
Sonic LRVTEGWDED GHHSEESLHY EGRAVDITTS DRDRSKYGML ARLAVEAGFD 
Desert LRVTEGWDED GHHAQDSLHY EGRALDITTS DRDRNKYGLL ARLAVEAGFD 



151 

Indian WVYYESKAHV HCSVKSEHSA AAKTGG SEQIDNO:23 

Sonic WVYYESKAHI HCSVKAENSV AAKSGG SEQIDN0.24 

Desert WVYYESRNHVHVSVKADNSL AVRAGG SEQIDN0.25 



Gap(s), indicated by added to facilitate alignment 
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Figure 2: SEQ ID NO: 26 is the consensus sequence of a hedgehog protein suitable for 
use in developing the conjugated proteins of the invention^ antagonist, where **Xaa" 
indicates amino acids that differ between the Sonic, Indian and Desert hedgehog proteins. 

C*G P G RXaalXaa2Xaa3Xaa4Xaa5 R R Xaa6 Xaa7 Xaa8 K Xaa9 L XaalO P 

L XaallY K Q F Xaal2 PXaal3 V Xaal4 EKTLGA SGR 

XaalSE GKXaal6Xaal7RXaal8 S E RFKXaal9LXaa20PN YN 



PDIIFKDEEN 

CKXaa23 Xaa24Xaa25NSLAI 

LRVTEGWDED 
EGRAVDITTS 

ARLAVEAGFD 

H Xaa36 S V K Xaa37 Xaa38 



Xaa21 G. A D R L M T Xaa22 R 

Xaa26VMNXaa27WPGVK 

G H H X2aa8 Xaa29 Xaa30 S LH Y 
DRDRXaa31 KYGXaa32L 

W V Y Y E S Xaa33 Xaa34 H Xaa35 

Xaa39 S Xaa40 A A Xaa41 Xaa42 G G 



Where 

C* is a cysteine that may be modified, altered or substituted within another moiety or 
series of moieties as described herein; 



Xaal is either V or G; 
Xaa4 is either S or G; 
Xaa7 is either P or A; 

XaalO is either V or 
Xaal2 is either S, I or V; 
Xaal 5 is either Y or A; 
Xaal8 is either S, N or G; 
Xaa21 is either T or S; 
Xaa24 is either R or K; 
Xaa27 is either Q or M; 
XaaSO is either E or D; 
Xaa33 is either K or R; 
Xaa36 is either C or V; 
Xaa39 is either H or N; 
Xaa42 is either T, S or A. 



Xaa2 is either V, E or P 
XaaS is either R or K; 
Xaa8 is eitherRor K; 
T; Xaal 1 is either A or 
Xaal3 is either N or G; 
Xaal 6 is either I or V; 
Xaal 9 is either E or D; 
Xaa22 is either Q or E; 
Xaa25 is either L or V; 
Xaa28 is either S or A; 
Xaa31 is either N or S; 
Xaa34 is either A or N; 
Xaa37 is either S or A; 
Xaa40 is either A, V or L; 



Xaa3 is either G or V 
Xaa6 is either P, H or Y; 
Xaa9 is any amino acid; 



L; 



Xaal4 is either P or A; 
Xaal7 is either A or S; 
Xaa20 is either Tor V; 
Xaa23 is either D or E; 
Xaa26 is either S or A; 
Xaa29 is either E or Q; 
Xaa32 is either L or M; 
Xaa35iseitherVorI; 
Xaa38 is either E or D; 
Xaa41 is either K or R; and 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: l.,1275 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATG GTC GAA ATG CTG CTG TTG ACA AGA ATT CTC TTG GTG GGC TTC ATC 48 
Met Val Glu Met Leu Leu Leu Thr Arg lie Leu Leu Val Gly Phe lie 
1 5 10 .15 

TGC GCT CTT TTA GTC TCC TCT GGG CTG ACT TGT GGA CCA GGC AGG GGC 96 
Cys Ala Leu Leu Val Ser Ser Gly Leu Thr Cys Gly Pro Gly Arg Gly 
20 25 30 

ATT GGA AAA AGG AGG CAC CCC AAA AAG CTG ACC CCG TTA GCC TAT AAG 144 
lie Gly Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys 
35 40 45 

CAG TTT ATT CCC AAT GTG GCA GAG AAG ACC CTA GGG GCC AGT GGA AGA 192 
Gin Phe lie Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg 
50 55 60 

TAT GAA GGG AAG ATC ACA AGA AAC TCC GAG AGA TTT AAA GAA CTA ACC 240 
Tyr Glu Gly Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr 
65 70 75 80 

CCA AAT TAC AAC CCT GAC ATT ATT TTT AAG GAT GAA GAG AAC ACG GGA 288 
Pro Asn Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly 

85 90 95 

GCT GAC AGA CTG ATG ACT CAG CGC TGC AAG GAC AAG CTG AAT GCC CTG 336 
Ala Asp Arg Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu 
100 105 110 

GCG ATC TCG GTG ATG AAC CAG TGG CCC GGG GTG AAG CTG CGG GTG ACC 384 
Ala He Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr 
115 120 125 

GAG GGC TGG GAC GAG GAT GGC CAT CAC TCC GAG GAA TCG CTG CAC TAC 432 
Glu Gly Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr 
130 135 140 

GAG GOT CGC GCC GTG GAC ATC ACC ACG TCG GAT CGG GAC CGC AGC AAG 480 
Glu Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg Ser Lys 
145 150 155 160 
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TAG GGA ATG CTG GCC CGC CTC GCC GTC GAG GCC GGC TTC GAC TGG GTC 528 
Tyr Gly Met Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 

165 170 175 

TAG TAG GAG TCC AAG GCG CAC ATC CAC TGG TGG GTG AAA GCA GAA AAC 576 
Tyr Tyr Glu Ser Lys Ala His lie His Gys Ser Val Lys Ala Glu Asn 
180 185 190 

TCA GTG GGA GCG AAA TCA GGA GGC TGG TTC CCT GGC TCA GCC ACA GTG 624 
Ser Val Ala Ala Lys Ser Gly Gly Gys Phe Pro Gly Ser Ala Thr Val 
195 200 205 

CAC CTG GAG CAT GGA GGC ACC AAG CTG GTG AAG GAC CTG AGG CCT GGG 672 
His Leu Glu His Gly Gly Thr Lys Leu Val Lys Asp Leu Ser Pro Gly 
210 215' 220 

GAC CGC GTG CTG GCT GCT GAC GCG GAC GGC GGG CTG CTC TAG AGT GAG 720 
Asp Arg Val Leu Ala Ala Asp Ala Asp Gly Arg Leu Leu Tyr Ser Asp 
225 230 235 240 

TTG GTC ACC TTC CTC GAC GGG ATG GAC AGC TCC CGA AAG CTC TTC TAG 768 
Phe Leu Thr Phe Leu Asp Arg Met Asp Ser Ser Arg Lys Leu Phe Tyr 

245 250 255 

GTC ATC GAG ACG GGG GAG CGC GGG GCC CGG CTG CTA CTG ACG GCG GCC 816 
Val lie Glu Thr Arg Gin Pro Arg Ala Arg Leu Leu Leu Thr Ala Ala 
260 265 270 

CAC CTG CTC TTT GTG GCC CCC GAG CAC AAC GAG TGG GAG GCC ACA GGG 864 
His Leu Leu Phe Val Ala Pro Gin His Asn Gin Ser Glu Ala Thr Gly 
275 280 285 

TCC ACC AGT GGC CAG GCG CTC TTC GCC AGC AAC GTG AAG CCT GGC CAA 912 
Ser Thr Ser Gly Gin Ala Leu Phe Ala Ser Asn Val Lys Pro Gly Gin 
290 295 300 

CGT GTC TAT GTG GTG GGG GAG GGC GGG GAG CAG CTG CTG GCG GCG TGT 960 
Arg Val Tyr Val Leu Gly Glu Gly Gly Gin Gin Leu Leu Pro Ala Ser 
305 310 315 320 

GTC CAC AGC GTC TCA TTG CGG GAG GAG GCG TCC GGA GCC TAG GCC CCA 1008 
Val His Ser Val Ser Leu Arg Glu Glu Ala Ser Gly Ala Tyr Ala Pro 

325 330 335 

GTC ACC GCG CAG GGC ACC ATC CTC ATC AAC CGG GTG TTG GCC TCC TGC 1056 
Leu Thr Ala Gin Gly Thr lie Leu lie Asn Arg Val Leu Ala Ser Gys 
340 345 350 

TAG GCC GTC ATC GAG GAG CAG AGT TGG GCG CAT TGG GCC TTC GCA CCA 1104 
Tyr Ala Val lie Glu Glu His Ser Trp Ala His Trp Ala Phe Ala Pro 
355 360 365 

TTC CGC TTG GCT CAG GGG CTG CTG GCC GCC CTC TGC CCA GAT GGG GCG 1152 
Phe Arg Leu Ala Gin Gly Leu Leu Ala Ala Leu Gys Pro Asp Gly Ala 
370 375 380 

ATC CCT ACT GCC GCC ACC ACC ACC ACT GGC ATC CAT TGG TAG TCA CGG 1200 
lie Pro Thr Ala Ala Thr Thr Thr Thr Gly He His Trp Tyr Ser Arg 
385 390 395 400 
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CTC CTC TAG CGC ATC GGC AGO TGG GTG CTG GAT GGT GAG GCG CTG CAT 1248 
Leu Leu Tyr Arg lie Gly Ser Trp Val Leu Asp Gly Asp Ala Leu His 

405 410 415 

COG CTG GGC ATG GTG GCA CCG GCC AGC TG 1277 
Pro Leu Gly Met Val Ala Pro Ala Ser 
420 425 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1190 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1191 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATG GCT CTG CCG GCC AGT CTG TTG CCC CTG TGC TGC TTG GCA CTC TTG 48 
Met Ala Leu Pro Ala Ser Leu Leu Pro Leu Cys Cys Leu Ala Leu Leu 
15 10 15 

GCA CTA TCT GCC CAG AGC TGC GGG CCG GGC CGA GGA CCG GTT GGC CGG 96 
Ala Leu Ser Ala Gin Ser Cys Gly Pro Gly Arg Gly Pro Val Gly Arg 
20 25 30 

CGG CGT TAT GTG CGC AAG CAA CTT GTG OCT CTG CTA TAG AAG CAG TTT 144 
Arg Arg Tyr Val Arg Lys Gin Leu Val Pro Leu Leu Tyr Lys Gin Phe 
35 40 45 

GTG CCC AGT ATG CCC GAG CGG ACC CTG GGC GCG AGT GGG CCA GCG GAG 192 
Val Pro Ser Met Pro Glu Arg Thr Leu Gly Ala Ser Gly Pro Ala ciu 
50 55 60 

GGG AGG GTA ACA AGG GGG TCG GAG CGC TTC CGG GAC CTC GTA CCC AAC 240 
Gly Arg Val Thr Arg Gly Ser Glu Arg Phe Arg Asp Leu Val Pro Asn 
65 70 75 80 

TAG AAC CCC GAC ATA ATC TTC AAG GAT GAG GAG AAC AGC GGC GCA GAC 288 
Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Ser Gly Ala Asp 

85 90 95 

CGC CTG ATG ACA GAG CGT TGC AAA GAG CGG GTG AAC GCT CTA GCC ATC 336 
Arg Leu Met Thr Glu Arg Cys Lys Glu Arg Val Asn Ala Leu Ala lie 
100 105 110 

GCG GTG ATG AAC ATG TGG CCC GGA GTA CGC CTA CGT GTG ACT GAA GGC 384 
Ala Val Met Asn Met Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 
115 120 125 



3 



wo 01/98344 PCT/USOl/19435 



TGG GAC GAG GAC GGC CAC CAC GCA CAG GAT TCA CTC CAC TAG GAA GGC 432 
Trp Asp Glu Asp Gly His His Ala Gin Asp Ser Leu His Tyr Glu Gly 
130 135 140 

CGT GCC TTG GAC ATC ACC ACG TCT GAC CGT GAC CGT AAT AAG TAT GGT 480 
Arg Ala Leu Asp lie Thr Thr Ser Asp Arg Asp Arg Asn Lys Tyr Gly 
145 150 155 160 

TTG TTG GOG GGC CTA GCT GTG GAA GCC GGA TTC GAC TGG GTC TAC TAC 528 
Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 

165 170 175 

GAG TCC CGC AAC CAC ATC CAC GTA TCG GTC AAA GCT GAT AAC TCA CTG 576 
Glu Ser Arg Asn His He His 'Val Ser Val Lys Ala Asp Asn Ser Leu 
180 185 190 

GCG GTC CGA GCC GGA GGC TGC TTT CCG GGA AAT GCC ACG GTG CGC TTG 624 
Ala Val Arg Ala Gly Gly Cys Phe Pro Gly Asn Ala Thr Val Arg Leu 
195 200 205 

"CGG AGC GGC GAA CGG AAG GGG CTG AGG GAA CTA CAT CGT GGT GAC TGG 672 
Arg Ser Gly Glu Arg Lys Gly Leu Arg Glu Leu His Arg Gly Asp Trp 
210 215 220 

GTA CTG GCC GCT GAT GCA GCG GGC CGA GTG GTA CCC ACG CCA GTG CTG 720 
Val Leu Ala /Via Asp Ala Ala Gly Arg Val Val Pro Thr Pro Val Leu 
225 230 235 240 

CTC TTC CTG GAC CGG GAT CTG CAG CGC CGC GCC TCG TTC GTG GCT GTG 7 68 

Leu Phe Leu Asp Arg Asp Leu Gin Arg Arg Ala Ser Phe Val Ala Val 

245 250 255 

GAG ACC GAG CGG CCT CCG CGC AAA CTG TTG CTC ACA CCC TGG CAT CTG 816 
Glu Thr Glu Arg Pro Pro Arg Lys Leu Leu Leu Thr Pro Trp His Leu 
260 265 270 

GTG TTC GCT GCT CGC GGG CCA GCG CCT GCT CCA GGT GAC TTT GCA CCG 864 
Val Phe Ala Ala Arg Gly Pro Ala Pro Ala Pro Gly Asp Phe Ala Pro 
275 280 285 

GTG TTC GCG CGC CGC TTA CGT GCT GGC GAC TCG GTG CTG GCT CCC GGC 912 
Val Phe Ala Arg Arg Leu Arg Ala Gly Asp Ser Val Leu Ala Pro Gly 
290 295 300 

GGG GAC GCG CTC CAG CCG GCG CGC GTA GCC CGC GTG GCG CGC GAG GAA 960 
Gly Asp Ala Leu Gin Pro Ala Arg Val Ala Arg Val Ala Arg Glu Glu 
305 310 315 320 

GCC GTG GGC GTG TTC GCA CCG CTC ACT GCG CAC GGG ACG CTG CTG GTC 1008 
Ala Val Gly Val Phe Ala Pro Leu Thr Ala His Gly Thr Leu Leu Val 

325 330 335 

AAC GAC GTC CTC GCC TCC TGC TAC GCG GTT CTA GAG AGT CAC CAG TGG 1056 
Asn Asp Val Leu Ala Ser Cys Tyr Ala Val Leu Glu Ser His Gin Trp 
340 345 350 

GCC CAC CGC GCC TTC GCC CCT TTG CGG CTG CTG CAC GCG CTC GGG GCT 1104 
Ala His Arg Ala Phe Ala Pro Leu Arg Leu Leu His Ala Leu Gly Ala 
355 360 365 



4 



wo 01/98344 



PCTAJSOl/19435 



CTG CTC CCT GGG GGT GCA GTC CAG CCG ACT GGC ATG CAT TGG TAG TCT 1152 
Leu Leu Pro Gly Gly Ala Val Gin Pro Thr Gly Met His Trp Tyr Ser 
370 375 380* 

CGC CTC CTT TAC CGC TTG GCC GAG GAG TTA ATG GGC TG 1190 
Arg Leu Leu Tyr Arg L^u Ala Glu Glu Leu Met Gly 
385 390 395 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSf both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1,.1233 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG TCT CCC GCC TGG CTC CGG CCC CGA CTG CGG TTC TGT CTG TTC CTG 48 
Met Ser Pro Ala Trp Leu Arg Pro Arg Leu Arg Phe Cys Leu Phe Leu 
1 5 10 . 15 

CTG CTG CTG CTT CTG GTG CCG GCG GCG CGG GGC TGC GGG CCG GGC CGG 96 
Leu Leu Leu Leu Leu Val Pro Ala Ala Arg Gly Cys Gly Pro Gly Arg 
20 25 30 

GTG GTG GGC AGC CGC CGG AGG CCG CCT CGC AAG CTC GTG CCT CTT GCC 144 
Val Val Gly Ser Arg Arg Arg Pro Pro Arg Lys Leu Val Pro Leu Ala 
35 40 45 

TAC AAG CAG TTC AGC CCC AAC GTG CCG GAG AAG ACC CTG GGC GCC AGC 192 
Tyr Lys Gin Phe Ser Pro Asn Val Pro Glu Lys Thr Leu Gly Ala Ser 
50 55 60 

GGG CGC TAC GAA GGC AAG ATC GCG CGC AGC TCT GAG CGC TTC AAA GAG 240 
Gly Arg Tyr Glu Gly Lys lie Ala Arg Ser Ser Glu Arg Phe Lys Glu 
65 70 75 80 

CTC ACC CCC AAC TAC AAT CCC GAC ATC ATC TTC AAG GAC GAG GAG AAC 288 
Leu Thr Pro Asn Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn 

85 , 90 95 

ACG GGT GCC GAC CGC CTC ATG ACC CAG CGC TGC AAG GAC CGT CTG AAC 336 
Thr Gly Ala Asp T^g Leu Met Thr Gin Arg Cys Lys Asp Arg Leu Asn 
100 105 110 

TCA CTG GCC ATC TCT GTC ATG AAC CAG TGG CCT GGT GTG AAA CTG CGG 384 
Ser Leu Ala lie Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg 
115 120 125 
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GTG ACC GAA GGC CGG GAT GAA GAT GGC CAT CAC TCA GAG GAG TCT TTA 432 
Val Thr Glu Gly Arg Asp Glu Asp Gly His His Ser Glu Glu Ser Leu 
130 135 140 ■ 

CAC TAT GAG GGC CGC GCG GTG GAT ATC ACC ACC TCA GAC CGT GAC CGA 480 
His Tyr Glu Gly Arg Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg 
145 150 155 160 

AAT AAG TAT GGA CTG CTG GCG CGC TTA GCA GTG GAG GCC GGC TTC GAC 528 
Asn Lys Tyr Gly Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp 

165 170 175 

TGG GTG TAT TAC GAG TCC AAG GCC CAC GTG CAT TGC TCT GTC AAG TCT 576 
Trp Val Tyr Tyr Glu Ser Lys Ala His Val His Cys Ser Val Lys Ser 
180 185 190 

GAG CAT TCG GCC GCT GCC AAG ACA GGT GGC TGC TTT CCT GCC GGA GCC 624 
Glu His Ser Ala Ala Ala Lys Thr Gly Gly Cys Phe Pro Ala Gly Ala 
195 200 205 

CAG GTG CGC CTA GAG AAC GGG GAG CGT GTG GCC CTG TCA GCT GTA AAG 672 
' Gin Val Arg Leu Glu Asn Gly Glu Arg Val Ala Leu Ser Ala Val Lys 
210 215 220 

CCA GGA GAC CGG GTG CTG GCC ATG GGG GAG GAT GGG ACC CCC ACC TTC 720 
Pro Gly Asp Arg Val Leu Ala Met Gly Glu Asp Gly Thr Pro Thr Phe 
225 230 235 240 

AGT GAT GTG CTT ATT TTC CTG GAC CGC GAG CCA AAC CGG CTG AGA GCT 768 
Ser Asp Val Leu lie Phe Leu Asp Arg Glu Pro Asn Arg Leu Arg Ala 

245 250 255 

TTC CAG GTC ATC GAG ACT CAG GAT CCT CCG CGT CGG CTG GCG CTC ACG 816 
Phe Gin Val lie Glu Thr Gin Asp Pro Pro Arg Arg Leu Ala Leu Thr 
260 265 270 

CCT GCC CAC CTG CTC TTC ATT GCG GAC AAT CAT ACA GAA CCA GCA GCC 864 
Pro Ala His Leu Leu Phe lie Ala Asp Asn His Thr Glu Pro Ala Ala 
275 280 285 

CAC TTC CGG GCC ACA TTT GCC AGC CAT GTG CAA CCA GGC CAA TAT GTG 912 
His Phe Arg Ala Thr Phe Ala Ser His Val Gin Pro Gly Gin Tyr Val 
290 295 300 

CTG GTA TCA GGG GTA CCA GGC CTC CAG CCT GCT CGG GTG GCA GCT GTC 960 
Leu Val Ser Gly Val Pro Gly Leu Gin Pro Ala Arg Val Ala Ala Val 
305 310 315 320 

TCC ACC CAC GTG GCC CTT GGG TCC TAT GCT CCT CTC ACA AGG CAT GGG 1008 
Ser Thr His Val Ala Leu Gly Ser Tyr Ala Pro Leu Thr Arg His Gly 

325 330 335 

ACA CTT GTG- GTG GAG GAT GTG GTG GCC TCC TGC TTT GCA GCT GTG GCT 1056 
Thr Leu Val Val Glu Asp Val Val Ala Ser Cys Phe Ala Ala Val Ala 
340 345 350 

GAC CAC CAT CTG GCT CAG TTG GCC TTC TGG CCC CTG CGA CTG TTT CCC 1104 
Asp His His Leu Ala Gin Leu Ala Phe Trp Pro Leu Arg Leu Phe Pro 
355 360 365 



6 



wo 01/98344 PCT/USOl/19435 

AGT TTG GCA TGG GGC AGC TGG ACC CCA AGT GAG GGT GTT CAC TCC TAG 1152 
Ser Leu Ala Trp Gly Ser Trp Thr Pro Ser Glu Gly Val His Ser Tyr 
370 375 380 

CCT CAG ATG CTC TAG CGC CTG GGG CGT CTC TTG CTA GAA GAG AGC ACC 1200 
Pro Gin Met Leu Tyr Arg Leu Gly Arg Leu Leu Leu Glu Glu Ser Thr 
385 390 395 400 

TTC CAT CCA CTG GGC ATG TCT GGG GCA GGA AGC TGAAGGGACT CTAACCACTG 1253 
Phe His Pro Leu Gly Met Ser Gly Ala Gly Ser 

405 • 410 

CCCTCCTGGA ACTGCTGTGC GTGGATCC ^^81 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1313 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATG CTG CTG CTG CTG GCC AGA TGT TTT CTG GTG ATC CTT GCT TCC TCG 48 
Met Leu Leu Leu Leu Ala Arg Cys Phe Leu Val He Leu Ala Ser Ser 



1 
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CTG CTG GTG TGC CCC GGG CTG GCC TGT GGG CCC GGC AGG GGG TTT GGA 96 
Leu Leu Val Cys Pro Gly Leu Ala Cys Gly Pro Gly Arg Gly Phe Gly 
20 25 30 

AAG AGG CGG CAC CCC AAA AAG CTG ACC CCT TTA GCC TAC AAG CAG TTT 144 
Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe 
35 40 45 

ATT CCC AAC GTA GCC GAG AAG ACC CTA GGG GCC AGC GGC AGA TAT GAA 192 
He Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu 
50 55 60 

GGG AAG ATC ACA AGA AAC TCC GAA CGA TTT AAG GAA CTC ACC CCC AAT 240 
Gly Lys He Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn 
65 70 75 80 

TAC AAC CCC GAC ATC ATA TTT AAG GAT GAG GAA AAC ACG GGA GCA GAC 288 
Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp 

85 90 95 

CGG CTG ATG ACT CAG AGG TGC AAA GAC AAG TTA AAT GCC TTG GCC ATC 336 
Arg Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala He • 
100 105 
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TCT GTG ATG AAC GAG* TGG CCT GGA GTG AGG CTG CGA GTG ACC GAG GGC 384 
Ser Val Met Asn Gin Trp Pro Gly Val Arg Leu Arq Val Thr Glu Gly 
115 120 l25 

TGG GAT GAG GAG GGC CAT CAT TCA GAG GAG TCT CTA CAC TAT GAG GGT 432 
Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu Gly 
130 135 140 

CGA GCA GTG GAC ATC ACC ACG TCC GAC CGG GAC CGC AGC AAG TAG GGC 480 
Arg Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg Ser Lys Tyr Gly 
145 150 155 160 

ATG CTG GCT CGC CTG GGT GTG GAA GCA GGT TTC GAC TGG GTG TAG TAT 528 
Met Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 

165 * 170 175 

GAA TCC AAA GCT GAC ATG CAC TGT TCT GTG AAA GCA GAG AAC TGG GTG 576 
Glu Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn Ser Val 
180 185 190 

GCG GCG AAA TCC GGC GGC TGT TTC CCG GGA TCC GCC ACC GTG CAC CTG 624 
Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His Leu 
195 200 205 

GAG CAG GGC GGC ACC AAG CTG GTG AAG GAC TTA CGT CCC GGA GAC CGC 672 
Glu Gin Gly Gly Thr Lys Leu Val Lys Asp Leu Arg Pro Gly Asp Arg 
210 215 220 

GTG CTG GCG GCT GAC GAC CAG GGC CGG CTG CTG TAG AGC GAC TTC CTG 720 
Val Leu Ala Ala Asp Asp Gin Gly Arg Leu Leu Tyr Ser Asp Phe Leu 
225 230 235 240 

ACC TTC CTG GAC CGC GAC GAA GGC GCC AAG AAG GTG TTC TAG GTG ATG 7 68 

Thr Phe Leu Asp Arg Asp Glu Gly Ala Lys Lys Val Phe Tyr Val lie 

245 250 255 

GAG ACG CTG GAG CCG CGG GAG CGG GTG GTG CTG ACC GCC GCG CAC CTG 816 
Glu Thr Leu Glu Pro Arg Glu Arg Xeu Leu Leu Thr Ala Ala His Leu 
260 265 270 

CTG TTC GTG GCG CGG GAG AAG GAC TGG GGG CCC ACG CCC GGG CCA AGC 864 
Leu Phe Val Ala Pro His Asn Asp Ser Gly Pro Thr Pro Gly Pro Ser 
275 280 285 

GCG CTG TTT GCC AGC CGC GTG CGC CCG GGG CAG CGC GTG TAG GTG GTG 912 
Ala Leu Phe Ala Ser Arg Val Arg Pro Gly Gin Arg Val Tyr Val Val 
290 295 300 

GCT GAA CGG GGC GGG GAG GGG CGG CTG GTG CCC GCC GCG GTG GAG AGC 960 
Ala Glu Arg Gly Gly Asp Arg Arg Leu Leu Pro Ala Ala Val His Ser 
305 310 315 320 

GTG ACG CTG GGA GAG GAG GAG GGG GGG GGG TAG GCG CGG CTG ACG GCG 1008 
Val Thr Leu Arg Glu Glu Glu Ala Gly Ala Tyr Ala Pro Leu Thr Ala 

325 330 335 _ 

CAC GGC ACC ATT CTG ATC AAC CGG GTG CTG GCC TGG TGC TAG GGT GTG 1056 
His Gly Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys Tyr Ala Val 
340 345 350 
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ATC GAG GAG CAC AGC TGG GCA CAC CGG GCC TTC GCG OCT TTC CGC CTG 1104 
lie Glu Glu His Ser Trp Ala His Arg Ala Phe Ala Pro Phe Arg Leu 
355 360 '365 

GCG CAC GCG CTG CTG GCC GCG CTG GCA CCC GCC CGC ACG GAC GGC GGG 1152 
Ala His Ala Leu Leu Ala Ala Leu Ala Pro Ala Arg Thr Asp Gly Gly 
370 375 380 

GGC GGG GGC AGC ATC CCT GCA GCG CAA TCT GCA ACG GAA GCG AGG GGC 1200 
Gly Gly Gly Ser lie Pro Ala Ala Gin Ser Ala Thr Glu Ala Arg Gly 
385 390 395 400 

GCG GAG CCG ACT GCG GGC ATC CAC TGG TAC TCG CAG CTG CTC TAG CAC 1248 
Ala Glu Pro. Thr Ala Gly lie His Trp Tyr Ser Gin Leu Leu Tyr His 

405 410 415 

ATT GGC ACC TGG CTG TTG GAC AGC GAG ACC ATG CAT CCC TTG GGA ATG 1296 
lie Gly Thr Trp Leu Leu Asp Ser Glu Thr Met His Pro Leu Gly Met 
420 425 430 

GCG GTC AAG TCC AGC TG 1313 
' Ala Val Lys Ser Ser 
435 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1256 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1257 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



ATG CGG CTT TTG ACG AGA GTG CTG CTG GTG TCT CTT CTC ACT CTG TCC 48 
Met Arg Leu Leu Thr Arg Val Leu Leu Val Ser Leu Leu Thr Leu Ser 
15 10 15 

TTG GTG GTG TCC GGA CTG GCC TGC GGT CCT GGC AGA GGC TAC GGC AGA 96 
Leu Val Val Ser Gly Leu Ala Cys Gly Pro Gly Arg Gly Tyr Gly Arg 
20 • 25 30 

AGA AGA CAT CCG AAG AAG CTG ACA CCT CTC GCC TAC AAG CAG TTC ATA 144 
Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe He 
35 40 45 

CCT AAT GTC GCG GAG AAG ACC TTA GGG GCC AGC GGC AGA TAC GAG GGC 192 
Pro Asn Val Ala Glu Lys Thr Leu Gly 7U.a Ser Gly Arg Tyr Glu Gly 
50 55 60 



9 



wo 01/98344 PCTAJSOl/19435 

AAG ATA ACG CGC AAT TCG GAG AGA TTT AAA GAA CTT ACT CCA AAT TAC 240 
Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr 
65 70 75 80 

AAT CCC GAC ATT ATC TTT AAG GAT GAG GAG AAC ACG GGA GCG GAC AGG 288 
Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp Arg 

85 90 95 

CTC ATG ACA CAG AGA TGC AAA GAC AAG CTG AAC TCG CTG GCC ATC TCT 336 
Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ser Leu Ala He Ser 
100 105 110 

GTA ATG AAC CAC TGG CCA GGG GTT AAG CTG CGT GTG ACA GAG GGC TGG 384 
Val Met Asn His Tip Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp 
115 ' 120 125 

GAT GAG GAC GGT CAC CAT TTT GAA GAA TCA CTC CAC TAC GAG GGA AGA 432 
Asp Glu Asp Gly His His Phe Glu Glu Ser Leu His Tyr Glu Gly Arg 
130 135 140 

GCT GTT GAT ATT ACC ACC TCT GAC CGA GAC AAG AGC AAA TAC GGG ACA 480 
Ala Val Asp He Thr Thr Ser Asp Arg Asp Lys Ser Lys Tyr Gly Thr 
145 150 155 160 

CTG TCT CGC CTA GCT GTG GAG GCT GGA TTT GAC TGG GTC TAT TAC GAG 528 
Leu Ser Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu 

165 170 175 

TCC AAA GCC CAC ATT CAT TGC TCT GTC AAA GCA GAA AAT TCG GTT GCT 576 
Ser Lys Ala His He His Cys Ser Val Lys Ala Glu Asn Ser Val Ala 
180 185 190 

GCG AAA TCT GGG GGC TGT TTC CCA GGT TCG GCT CTG GTC TCG CTC CAG 624 
Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Leu Val Ser Leu Gin 
195 200 205 

GAC GGA GGA CAG AAG GCC GTG AAG GAC CTG AAC CCC GGA GAC AAG GTG 672 
Asp Gly Gly Gin Lys Ala Val Lys Asp Leu Asn Pro Gly Asp Lys Val 
210 215 220 

CTG GCG GCA GAC AGC GCG GGA AAC CTG GTG TTC AGC GAC TTC ATC ATG 720 
Leu Ala Ala Asp Ser Ala Gly Asn Leu Val Phe Ser Asp Phe He Met 
225 230 235 240 

TTC ACA GAC CGA GAC TCC ACG ACG CGA CGT GTG TTT TAC GTC ATA GAA 7 68 

Phe Thr Asp Arg Asp Ser Thr Thr Arg Arg Val Phe Tyr Val He Glu 

245 250 255 

ACG CAA GAA CCC GTT GAA AAG ATC ACC CTC ACC GCC GCT CAC CTC CTT 816 
Thr Gin Glu Pro Val Glu Lys He Thr Leu Thr Ala Ala His Leu Leu 
260 265 270 

TTT GTC CTC GAC AAC TCA ACG GAA GAT CTC CAC ACC ATG ACC GCC GCG 864 
Phe Val Leu Asp Asn Ser Thr Glu Asp Leu His Thr Met Thr Ala Ala 
275 280 285 

TAT GCC AGC AGT GTC AGA GCC GGA CAA AAG GTG ATG GTT GTT GAT GAT 912 
Tyr Ala Ser Ser Val Arg Ala Gly Gin Lys Val Met Val Val Asp Asp 
290 295 300 
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AGC GGT CAG CTT AAA TCT GTC ATC GTG GAG CGG ATA TAG ACG GAG GAG 960 
Ser Gly Gin Leu Lys Ser Val He Val Gin Arg He Tyr Thr Glu Glu 
305 310 315 320 

CAG CGG GGC TCG TTC GCA CCA GTG ACT GCA CAT GGG ACC ATT GTG GTC 1008 
Gin Arg Gly Ser Phe Al^i Pro Val Thr Ala His Gly Thr He Val Val 

325 330 335 

GAC AGA ATA CTG GCG TCC TGT TAC GCC GTA ATA GAG GAC CAG GGG CTT 1056 
Asp Arg He Leu Ala Ser Cys Tyr Ala Val He Glu Asp Gin Gly Leu 
340 345 350 

GCG CAT TTG GCC TTC GCG CCC GCC AGG CTC TAT TAT TAC GTG TCA TCA 1104 
Ala His Leu Ala Phe Ala Pro Ala Arg Leu Tyr Tyr Tyr Val Ser Ser 
355 360 365 

TTC CTG TCC CCC AAA ACT CCA GCA GTC GGT CCA ATG CGA CTT TAC AAC 1152 
Phe Leu Ser Pro Lys Thr Pro Ala Val Gly Pro Met Arg Leu Tyr Asn 
370 375 380 

AGG AGG GGG TCC ACT GGT ACT CCA GGC TCC TGT CAT CAA ATG GGA ACG 1200 
'Arq Arg Gly Ser Thr Gly Thr Pro Gly Ser Cys His Gin Met Gly Thr 
385 390 395 400 

TGG CTT TTG GAC AGC AAC ATG CTT CAT CCT TTG GGG ATG TCA GTA AAC 1248 
Trp Leu Leu Asp Ser Asn Met Leu His Pro Leu Gly Met Ser Val Asn 

405 410 415 



TCA AGC TG 
Ser Ser 



1256 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1425 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ATG CTG CTG CTG GCG AGA TGT CTG CTG CTA GTC CTC GTC TCC TCG CTG 48 
Met Leu Leu Leu Ala Arg Cys Leu Leu Leu Val Leu Val Ser Ser Leu 
15 10 15 

CTG GTA TGC TCG GGA CTG GCG TGC GGA CCG GGC AGG GGG TTC GGG AAG 96 
Leu Val Cys Ser Gly Leu Ala Cys Gly Pro Gly Arg Gly Phe Gly Lys 
20 25 30 
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AGG AGG CAC CCC AAA AAG CTG ACC CCT TTA GCC TAG AAG CAG TTT ATC 144 
Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe lie 
35 40 * 45 

CCC AAT GTG GCC GAG AAG ACC CTA GGC GCC AGC GGA AGG TAT GAA GGG 192 
Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu Gly 
50 55 60 

AAG ATC TCC AGA AAC TCC GAG CGA TTT AAG GAA CTC ACC CCC AAT TAG 240 
Lys He Ser Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr 
65 70 75 80 

AAC CCC GAG ATC ATA TTT AAG GAT GAA GAA AAC ACC GGA GCG GAC AGG 288 
Asn Pro Asp He He Phe Lys Asp (?lu Glu Asn Thr Gly Ala Asp Arg 

85 ' 90 95 

CTG ATG ACT CAG AGG TGT AAG GAC AAG TTG AAC GCT TTG GCC ATC TCG 336 
Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala He Ser 
100 105 110 

GTG ATG AAC CAG TGG CCA GGA GTG AAA CTG GGG GTG ACC GAG GGC TGG 384 
' Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp 
115 120 125 

GAC GAA GAT GGC CAC CAC TCA GAG GAG TCT CTG CAC TAC GAG GGC CGC 432 
Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu Gly Arg 
130 135 140 

GCA GTG GAC ATC ACC ACG TCT GAC CGC GAC CGC AGC AAG TAC GGC ATG .480 
Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg Ser Lys Tyr Gly Met 
145 150 155 160 

CTG GCC CGC CTG GCG GTG GAG GCC GGC TTC GAC TGG GTG TAC TAC GAG 528 
Leu Ala Arg Leu- Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu 

165 170 175 

TCC AAG GCA CAT ATC CAC TGC TCG GTG AAA GCA GAG AAC TCG GTG GCG 57 6 

Ser Lys Ala His He His Cys Ser Val Lys Ala Glu Asn Ser Val Ala 
180 185 190 

GCC AAA TCG GGA GGC TGC TTC CCG GGC TCG GCC ACG GTG CAC CTG GAG 624 
Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His Leu Glu 
195 200 205 

CAG GGC GGC ACC AAG CTG GTG AAG GAC CTG AGC CCC GGG GAC CGC GTG 672 
Gin Gly Gly Thr Lys Leu Val Lys Asp Leu Ser Pro Gly Asp Arg Val 
210 215 220 

CTG GCG GCG GAC GAC CAG GGC CGG CTG CTC TAC AGC GAC TTC CTC ACT 720 
Leu Ala Ala Asp Asp Gin Gly Arg Leu Leu Tyr Ser Asp Phe Leu Thr 
225 230 235 240 

TTC CTG GAC CGC GAC GAC GGC GCC AAG AAG GTC TTC TAC GTG ATC GAG 768 
Phe Leu Asp Arg Asp Asp Gly Ala Lys Lys Val Phe Tyr Val He Glu 

245 250 255 

ACG CGG GAG CCG CGC GAG CGC CTG CTG CTC ACC GCC GCG CAC CTG CTC 816 
Thr Arg Glu Pro Arg Glu Arg Leu Leu Leu Thr Ala Ala His Leu Leu 
260 265 270 
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TTT GTG GCG CCG CAC AAC GAG TCG GCC AGO GGG GAG CCC GAG GCG TOG 864 
Phe Val Ala Pro His Asn Asp Ser Ala Thr Gly Glu Pro Glu Ala Ser 
275 280 285 

TCG GGG TCG GGG CCG CCT TCC GGG GGC GCA CTG GGG CCT CGG GCG CTG 912 
Ser Gly Ser Gly Pro Pro Ser Gly Gly Ala Leu Gly Pro Arg Ala I*eu 
290 295 300 

TTC GCC AGC CGC GTG CGC CCG GGC CAG CGC GTG TAC GTG GTG GCC GAG 960 
Phe Ala Ser Arg Val Arg Pro Gly Gin Arg Val Tyr Val Val Ala Glu 
305 -310 315 320 



CGT GAC GGG GAC CGC CGG CTC CTG CCC GCC GCT GTG CAC AGC GTG ACC 
Arg Asp Gly Asp Arg Arg Leu Leu Pro Ala Ala Val His Ser Val Thr 

325 330 335 



1008 



CTA AGC GAG GAG GCC GCG GGC GCC TAC GCG CCG CTC ACG GCC CAG GGC 1056 
Leu Ser Glu Glu Ala Ala Gly TUa Tyr Ala Pro Leu Thr Ala Gin Gly 
340 345 350 

ACC ATT CTC ATC AAC CGG GTG CTG GCC TCG TGC TAC GCG GTC ATC GAG 1104 
Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys Tyr Ala Val He Glu 
355 360 365 

GAG CAC AGC TGG GCG CAC CGG GCC TTC GCG CCC TTC CGC CTG GCG CAC 1152 
Glu His Ser Trp Ala His Arg Ala Phe Ala Pro Phe Arg Leu Ala His 
370 375 380 

GCG CTC CTG GCT GCA CTG GCG CCC GCG CGC ACG GAC CGC GGC GGG GAC 1200 
Ala Leu Leu Ala Ala Leu Ala Pro Ala Arg Thr Asp Arg Gly Gly Asp 
385 390 395 400 

AGC GGC GGC GGG GAC CGC GGG GGC GGC GGC GGC AGA GTA GCC CTA ACC 1248 
Ser Gly Gly Gly Asp Arg Gly Gly Gly Gly Gly Arg Val Ala Leu Thr 

405 410 415 

GCT CCA GGT GCT GCC GAC GCT CCG GGT GCG GGG GCC ACC GCG GGC ATC 1296 
Ala Pro Gly Ala Ala Asp Ala Pro Gly Ala Gly Ala Thr Ala Gly He 
420 425 430 

CAC TGG TAC TCG CAG CTG CTC TAC CAA ATA GGC ACC TGG CTC CTG GAC 1344 
His Trp Tyr Ser Gin Leu Leu Tyr Gin He Gly Thr Trp Leu Leu Asp 
435 440 445 

AGC GAG GCC CTG CAC CCG CTG GGC ATG GCG GTC AAG TCC AGC NNN AGC 1392 
Ser Glu Ala Leu His Pro Leu Gly Met Ala Val Lys Ser Ser Xaa Ser 
450 455 460 

CGG GGG GCC GGG GGA GGG GCG CGG GAG GGG GCC 1425 
Arg Gly Ala Gly Gly Gly Ala Arg Glu Gly Ala 
465 470 475 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY.: CDS 

(B) LOCATION: 51.. 1283 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CATCAGCCCA CCAGGAGACC TCGCCCGCCG CTCCCCCGGG CTCCCCGGCC ATG TCT 56 

Met Ser 
1 

CCC GCC CGG CTC CGG CCC CGA CTG CAC TTC TGC CTG GTC CTG TTG CTG 104 
Pro Ala Arg Leu Arg Pro Arg Leu His Phe Cys Leu Val Leu Leu Leu 
5 10 15 

CTG CTG GTG GTG CCC GCG GCA TGG GGC TGC GGG CCG GGT CGG GTG GTG 152 
Leu Leu Val Val Pro Ala Ala Trp Gly Cys Gly Pro Gly Arg Val Val 
20 25 30 

GGC AGC CGC CGG CGA CCG CCA CGC AAA CTC GTG CCG CTC GCC TAG AAG 200 
Gly Ser Arg Arg Arg Pro Pro Arg Lys Leu Val Pro Leu Ala Tyr Lys 
35 40 45 50 

CAG TTC AGC CCC AAT GTG CCC GAG AAG ACC CTG GGC GCC AGC GGA CGC 248 
Gin Phe Ser Pro Asn Val Pro Glu Lys Thr Leu Gly Ala Ser Gly Arg 

55 60 65 

TAT GAA GGC AAG ATC GCT CGC AGC TCC GAG CGC TTC AAG GAG CTC ACC 296 
Tyr Glu Gly Lys lie Ala Arg Ser Ser Glu Arg Phe Lys Glu Leu Thr 
70 75 80 

CCC AAT TAG AAT CCA GAC ATC ATC TTC AAG GAC GAG GAG AAC ACA GGC 344 
Pro Asn Tyr Asn Pro Asp lie He Phe Lys Asp Glu Glu Asn Thr Gly 
85 90 95 

GCC GAC CGC CTC ATG ACC CAG CGC TGC AAG GAC CGC CTG AAC TCG CTG 392 
Ala Asp Arg Leu Met Thr Gin Arg Cys Lys Asp Arg Leu Asn Ser Leu 
100 105 110 

GCT ATC TCG GTG ATG AAC CAG TGG CCC GGT GTG AAG CTG CGG GTG ACC 440 
Ala He Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr 
115 120 125 130 

GAG GGC TGG GAC GAG GAC GGC CAC CAC TCA GAG GAG TCC CTG ' CAT TAT 488 
Glu Gly Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr 

135 • 140 145 

GAG GGC CGC GCG GTG GAC ATC ACC ACA TCA GAC CGC GAC CGC AAT AAG 536 
Glu Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg Asn Lys 
150 155 160 

TAT GGA CTG CTG GCG CGC TTG GCA GTG GAG GCC GGC TTT GAC TGG GTG 584 
Tyr Gly Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 
165 170 175 
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TAT TAG GAG TCA AAG GCC CAC GTG CAT TGC TCC GTC AAG TCC GAG CAC 632 
Tyr Tyr Glu Ser Lys Ala His Val His Cys Ser Val Lys Ser Glu His 
180 185 190 * 

TCG GCC GCA GCC AAG ACG GGC GGC TGC TTC CCT GCC GGA GCC CAG GTA 680 
Ser Ala Ala Ala Lys Thr Gly Gly Cys Phe Pro Ala Gly Ala Gin Val 
195 200 205 210 

CGC CTG GAG AGT GGG GCG CGT GTG GCC TTG TCA GCC GTG AGG CCG GGA 728 
Arg Leu Glu Ser Gly Ala Arg Val Ala Leu Ser Ala Val Arg Pro Gly 

215 220 225 

GAC CGT GTG CTG GCC ATG GGG GAG GAT GGG AGC CCC ACC TTC AGC GAT 776 
Asp Arg Val Leu Ala Met Gly Glu Asp Gly Ser Pro Thr Phe Ser Asp 
230 235 240 

GTG CTC ATT TTC CTG GAC CGC GAG CCC CAC AGG CTG AGA GCC TTC CAG 824 
Val Leu lie Phe Leu Asp Arg Glu Pro His Arg Leu Arg Ala Phe Gin 
245 250 255 

GTC ATC GAG ACT CAG GAC CCC CCA CGC CGC CTG GCA CTC ACA CCC GCT 872 
Val He Glu Thr Gin Asp Pro Pro Arg Arg Leu Ala Leu Thr Pro Ala 
260 265 270 

CAC CTG CTC TTT ACG GCT GAC AAT CAC ACG GAG CCG GCA GCC CGC TTC 920 
His Leu Leu Phe Thr Ala Asp Asn His Thr Glu Pro Ala Ala Arg Phe 
275 280 285 290 

CGG GCC ACA TTT GCC AGC CAC GTG CAG CCT GGC CAG TAC GTG CTG GTG 968 
Arg Ala Thr Phe Ala Ser His Val Gin Pro Gly Gin Tyr Val Leu Val 

295 300 305 

GCT GGG GTG CCA GGC CTG CAG CCT GCC CGC GTG GCA GCT GTC TCT ACA 1016 
Ala Gly Val Pro Gly Leu Gin Pro Ala Arg Val Ala Ala Val Ser Thr 
310 315 320 

CAC GTG GCC CTC GGG GCC TAC GCC CCG CTC ACA AAG CAT GGG ACA CTG 1064 
His Val Ala Leu Gly Ala Tyr Ala Pro Leu Thr Lys His Gly Thr Leu 
325 330 335 

GTG GTG GAG GAT GTG GTG GCA TCC TGC TTC GCG GCC GTG GCT GAC CAC 1112 
Val Val Glu Asp Val Val Ala Ser Cys Phe Ala Ala Val Ala Asp His 
340 345 350 

CAC CTG GCT CAG TTG GCC TTC TGG CCC CTG AGA CTC TTT CAC AGC TTG 1160 
His Leu Ala Gin Leu Ala Phe Trp Pro Leu Arg Leu Phe His Ser Leu 
355 360 365 370 

GCA TGG GGC AGC TGG ACC CCG GGG GAG GGT GTG CAT TGG TAC CCC CAG 1208 
Ala Trp Gly Ser Trp Thr Pro Gly Glu Gly Val His Trp Tyr Pro Gin 

375 380 385 

CTG CTC TAC CGC CTG GGG CGT CTC CTG CTA GAA GAG GGC AGC TTC CAC 1256 
Leu Leu Tyr Arg Leu Gly Arg Leu Leu Leu Glu Glu Gly Ser Phe His 
390 395 400 

CCA CTG GGC ATG TCC GGG GCA GGG AGC TGAAAGGACT CCACCGCTGC 1303 
Pro Leu Gly Met Ser Gly Ala Gly Ser 
405 410 
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CCTCCTGGAA CTGCTGTACT GGGTCCAGAA GCCTCTCAGC CAGGAGGGAG CTGGCCCT6G 1363 

AAGGGACCTG AGCTGGGGGA CACTGGCTCC TGCCATCTCC TCTGCCATGA AGATACACCA 1423 

TTGAGACTTG ACTGGGCAAC ACCAGCGTCC CCCACCCGCG TCGTGGTGTA GTCATAGAGC 1483 

TGCAAGCTGA GCTGGCGAGG GGATGGTTGT TGACCCCTCT CTCCTAGAGA CCTTGAGGCT 1543 

GGCACGGCGA CTCCCAACTC AGCCTGCTCT CACTAC6AGT TTTCATACTC TGCCTCCCCC 1603 

ATTGGGAGGG CCCATTCCC 1622 
(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1191 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: l.,1191 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATG GCT CTC CTG ACC AAT CTA CTG CCC TTG TGC TGC TTG GCA CTT CTG 48 
Met Ala Leu Leu Thr Asn Leu Leu Pro Leu Cys Cys Leu Ala Leu Leu 
15 10 15 

GCG CTG CCA GCC CAG AGC TGC GGG CCG GGC CGG 6GG CCG GTT GGC CGG 96 
Ala Leu Pro Ala Gin Ser Cys Gly Pro Gly Arg Gly Pro Val Gly Arg 
20 25 30 

CGC CGC TAT GCG CGC AAG CAG CTC GTG CCG CTA CTC TAC AAG CAA TTT 144 
Arg Arg Tyr Ala Arg Lys Gin Leu Val Pro Leu Leu Tyr Lys Gin Phe 
35 40 45 

GTG CCC GGC GTG CCA GAG CGG ACC CTG GGC GCC AGT GGG CCA GCG GAG 192 
Val Pro Gly Val Pro Glu Arg Thr Leu Gly Ala Ser Gly Pro Ala Glu 
50 55 60 

GGG AGG GTG GCA AGG GGC TCC GAG CGC TTC CGG GAC CTC GTG CCC AAC 240 
Gly Arg Val Ala Arg Gly Ser Glu Arg Phe Arg Asp Leu Val' Pro Asn 
65 70 75 80 

4 

TAC AAC CCC GAC ATC ATC TTC AAG GAT GAG GAG AAC AGT GGA GCC GAC 288 
Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Ser Gly Ala Asp 

85 90 95 

CGC CTG ATG ACC GAG CGT TGC AAG GAG AGG GTG AAC GCT TTG GCC ATT 336 
Arg Leu Met Thr Glu Arg Cys Lys Glu Arg Val Asn Ala Leu Ala lie 
100 105 110 

GCC GTG ATG AAC ATG TGG CCC GGA GTG CGC CTA CGA GTG ACT GAG GGC 384 
Ala Val Met Asn Met Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 
115 120 125 
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TGG GAC GAG GAC GGC CAC CAC GOT CAG GAT TCA CTC CAC TAG GAA GGC 432 
Trp Asp Glu Asp Gly His His Ala Gin Asp Ser Leu His Tyr Glu Gly 
130 135 . 140 

CGT GCT TTG GAC ATC ACT ACG TCT GAC CGC GAC CGC AAC AAG TAT GGG 480 
Arg Ala Leu Asp lie Thr Thr Ser Asp Arg Asp Arg Asn Lys Tyr Gly 
145 150 155 160 

TTG CTG GCG CGC CTC GCA GTG GAA GGC GGC TTC GAC TGG GTC TAC TAG 528 
Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 

165 170 175 

GAG TCC CGC AAC CAC GTC CAC GTG TCG GTC AAA GCT GAT AAC TCA CTG 576 
Glu Ser Arg Asn His Val His Val Ser Val Lys Ala Asp Asn Ser Leu 
180 185 190 

GCG GTC CGG GCG GGC GGC TGC TTT CCG GGA AAT GCA ACT GTG CGC CTG 624 
Ala Val Arg Ala Gly Gly Cys Phe Pro Gly Asn Ala Thr Val Arg Leu 
195 200 205 

TGG AGC GGC GAG CGG AAA GGG CTG CGG GAA CTG CAC CGC GGA GAC TGG 672 
Trp Ser Gly Glu Arg Lys Gly Leu Arg Glu Leu His Arg Gly Asp Trp 
210 215 220 

GTT TTG GCG GCC GAT GCG TCA GGC CGG GTG GTG CCC ACG CCG GTG CTG 720 
Val Leu Ala Ala Asp Ala Ser Gly Arg Val Val Pro Thr Pro Val Leu 
225 230 235 240 

CTC TTC CTG GAC CGG GAC TTG CAG CGC CGG GCT TCA TTT GTG GCT GTG 768 
Leu Phe Leu Asp Arg Asp Leu Gin Arg Arg Ala Ser Phe Val Ala Val 

245 250 255 

GAG ACC GAG TGG CCT CCA CGC AAA CTG TTG CTC ACG CCC TGG CAC CTG 816 
Glu Thr Glu Trp Pro Pro Arg Lys Leu Leu Leu Thr Pro Trp His Leu 
260 265 270 

GTG TTT GCC GCT CGA GGG CCG GCG CCC GCG CCA GGC GAC TTT GCA CCG 864 
Val Phe Ala Ala Arg Gly Pro Ala Pro Ala Pro Gly Asp Phe Ala Pro 
275 280 285 

GTG TTC GCG CGC CGG CTA CGC GCT GGG GAC TCG GTG CTG GCG CCC GGC 912 
Val Phe Ala Arg Arg Leu Arg Ala Gly Asp Ser Val Leu Ala Pro Gly 
290 295 300 

GGG GAT GCG CTT CGG CCA GCG CGC GTG GCC CGT GTG GCG CGG GAG GAA 960 
Gly Asp Ala Leu Arg Pro Ala Arg Val Ala Arg Val Ala Arg Glu Glu 
305 310 315 320 

GCC GTG GGC GTG TTC GCG CCG CTC ACC GCG CAC GGG ACG CTG CTG GTG 1008 
Ala Val Gly Val Phe Ala Pro Leu Thr Ala His Gly Thr Leu Leu Val 

325 330 335 

AAC GAT GTC CTG GCC TCT TGC TAC GCG GTT CTG GAG AGT CAC CAG TGG 1056 
Asn Asp Val Leu Ala Ser Cys Tyr Ala Val Leu Glu Ser His Gin Trp 
340 345 350 

GCG CAC CGC GCT TTT GCC CCC TTG AGA CTG CTG CAC GCG CTA GGG GCG 1104 
Ala His Arg Ala Phe Ala Pro Leu Arg Leu Leu His Ala Leu Gly Ala 
355 360 365 
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CTG CTC CCC GGC GGG GCC GTC CAG CCG ACT GGC ATG CAT TGG TAG TCT 1152 

Leu Leu Pro Gly Gly Ala Val Gin Pro Thr Gly Met His Trp Tyr Ser 

370 375 380 ' 

CGG CTC CTC TAG CGC TTA GCG GAG GAG CTA CTG GGC TG 1191 

Arg Leu Leu Tyr Arg Lau Ala Glu Glu Leu Leu Gly 

385 390 395 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 base pairs 

(B) TYPE: nucleic* acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1248 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG GAC GTA AGG CTG CAT CTG AAG CAA TTT GCT TTA CTG TGT TTT ATC 48 
Met Asp Val Arg Leu His Leu Lys Gin Phe Ala Leu Leu Cys Phe lie 
15 10 15 

AGC TTG CTT CTG ACG CCT TGT GGA TTA GCC TGT GGT CCT GGT AGA GGT 96 
Ser Leu Leu Leu Thr Pro Cys Gly Leu Ala Cys Gly Pro Gly Arg Gly 
20 25 30 

TAT GGA AAA CGA AGA CAC CCA AAG AAA TTA ACC CCG TTG GCT TAC AAG 14 4 

Tyr Gly Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys 
35 40 45 

CAA TTC ATC CCC AAC GTT GCT GAG AAA ACG CTT GGA GCC AGC GGC AAA 192 
Gin Phe lie Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly tys 
50 55 60 

TAC GAA GGC AAA ATC ACA AGG AAT TCA GAG AGA TTT AAA GAG CTG ATT 240 
Tyr Glu Gly Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu lie 
65 70 75 80 

CCG AAT TAT AAT CCC GAT ATC ATC TTT AAG GAC GAG GAA AAC ACA AAC 288 
Pro Asn Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Asn 

85 90 95 

GCT GAC AGG CTG ATG ACC AAG CGC TGT AAG GAC AAG TTA AAT TCG TTG 336 
Ala Asp Arg Leu Met Thr Lys Arg Cys Lys Asp Lys Leu Asn Ser Leu 
100 105 110 

GCC ATA TCC GTC ATG AAC CAC TGG CCC GGC GTG AAA CTG CGC GTC ACT 384 
Ala He Ser Val Met Asn His Trp Pro Gly Val Lys Leu Arg Val Thr 
115 120 125 
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GAA GGC TGG GAT GAG GAT GGT CAC CAT TTA GAA GAA TCT TTG CAC TAT 432 
Glu Gly Trp Asp Glu Asp Gly His His Leu Glu Glu Ser Leu His Tyr 
130 135 140 ' 

GAG GGA CGG GCA GTG GAG ATC ACT ACC TCA GAC AGG GAT AAA AGC AAG 480 
Glu Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Lys Ser Lys 
145 150 155 160 

TAT GGG ATG CTA TOO AGG CTT GCA GTG GAG GCA GGA TTC GAC TGG GTC 528 
Tyr Gly Met Leu Ser Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 

165- 170' 175 

TAT TAT GAA TCT AAA GCC CAC ATA CAC TGC TCT GTC AAA GCA GAA AAT 576 
Tyr Tyr Glu Ser Lys Ala His He His Cys Ser Val Lys Ala Glu Asn 
180 ' 185 190 

TCA GTG GCT GCT AAA TCA GGA GGA TGT TTT CCT GGG TCT GGG ACG GTG 624 
Ser Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Gly Thr Val 
195 200 205 

ACA CTT GGT GAT GGG ACG AGG AAA CCC ATC AAA GAT CTT AAA GTG GGC 672 
' Thr Leu Gly Asp Gly Thr Arg Lys Pro He Lys Asp Leu Lys Val Gly 
210 215 220 

GAC CGG GTT TTG GCT GCA GAC GAG AAG GGA AAT GTC TTA ATA AGC GAC 720 
Asp Arg Val Leu Ala Ala Asp Glu Lys Gly Asn Val Leu He Ser Asp 
225 230 235 240 

TTT ATT ATG TTT ATA GAC CAC GAT CCG ACA ACG AGA AGG CAA TTC ATC 7 68 

Phe He Met Phe He Asp His Asp Pro Thr Thr Arg Arg Gin Phe He 

245 250 255 

GTC ATC GAG ACG TCA GAA CCT TTC ACC AAG CTC ACC CTC ACT GCC GCG 816 
Val He Glu Thr Ser Glu Pro Phe Thr Lys Leu Thr Leu Thr Ala Ala 
260 265 270 

CAC CTA GTT TTC GTT GGA AAC TCT TCA GCA GCT TCG GGT ATA ACA GCA 864 
His Leu Val Phe Val Gly Asn Ser Ser Ala Ala Ser Gly He Thr Ala 
275 280 285 

ACA TTT GCC AGC AAC GTG AAG CCT GGA GAT ACA GTT TTA GTG TGG GAA 912 
Thr Phe Ala Ser Asn Val Lys Pro Gly Asp Thr Val Leu Val Trp Glu 
290 295 300 

GAC ACA TGC GAG AGC CTC AAG AGC GTT ACA GTG AAA AGG ATT TAC ACT ' 960 
Asp Thr Cys Glu Ser Leu Lys Ser Val Thr Val Lys Arg He Tyr Thr 
305 310 315 320 

GAG GAG CAC GAG GGC TCT TTT GCG CCA GTC ACC GCG CAC GGA ACC ATA 1008 
Glu Glu His Glu Gly Ser Phe Ala Pro Val Thr Ala His Gly Thr He 

325 330 335 

ATA GTG GAT CAG GTG TTG GCA TCG TGC TAC GCG GTC ATT GAG AAC CAC 1056 
.He Val Asp Gin Val Leu Ala Ser Cys Tyr Ala Val He Glu Asn His 
340 • 345 350 

AAA TGG GCA CAT TGG GCT TTT GCG CCG GTC AGG TTG TGT CAC AAG CTG 1104 
Lys Trp Ala His Trp Ala Phe Ala Pro Val Arg Leu Cys His Lys Leu 
355 360 365 
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ATG ACG TGG CTT TTT CCG GCT CGT GAA 
Met Thr Trp Leu Phe Pro Ala Arg Glu 
370 375 

GAT GGT ATC CAC TGG TAG TCA AAT ATG 
Asp Gly He His Trp Tyr Ser Asn Met 
385 390 

CTG CTG GAG AGA GAG TCT TTC CAT CCA 
Leu Leu Asp Arg Asp Ser Phe His Pro 

405- 

TGA 



TCA AAC GTC AAT TTT GAG GAG 1152 
Ser Asn Val Asn Phe Gin Glu 
380 ' 

CTG TTT CAC ATC GGC TCT TGG 1200 
Leu Phe His He Gly Ser Trp 
395 400 

CTG GGG ATT TTA CAC TTA AGT 1248 
Leu Gly He Leu His Leu Ser 
410 415 

1251 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Val Glu Met Leu Leu Leu Thr Arg He Leu Leu Val Gly Phe He 
15 10 15 

Cys Ala Leu Leu Val Ser Ser Gly Leu Thr Cys Gly Pro Gly Arg Gly 
20 25 30 

He Gly Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys 
35 40 45 

Gin Phe He Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg 
50 55 60 

Tyr Glu Gly Lys He Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr 
65 70 75 ^80 

Pro Asn Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Gly 

85 90 95 

Ala Asp Arg Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu 
100 105 110 

Ala He Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr 
115 . 120 125 

Glu Gly Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr 
130 135 140 

Glu Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg Ser Lys 
145 150 155 160 

Tyr Gly Met Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 

165 170 175 
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Tyr Tyr Glu Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn 
180 185 190 

Ser Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val 
195 200 205 

His Leu Glu His Gly Gly Thr Lys Leu Val Lys Asp Leu Ser Pro Gly 
210 215 220 

Asp Arg Val Leu Ala Ala Asp Ala Asp Gly Arg Leu Leu Tyr Ser Asp 
225 230 235 240 

Phe Leu Thr Phe Leu Asp Arg Met Asp Ser Ser Arg Lys Leu Phe Tyr 

245 250 255 

Val lie Glu Thr Arg Gin Pro Arg Ala Arg Leu Leu Leu Thr Ala Ala 
260 265 270 

His Leu Leu Phe Val Ala Pro Gin His Asn Gin Ser Glu Ala Thr Gly 
275 280 285 

Ser Thr Ser Gly Gin Ala Leu Phe Ala Ser Asn Val Lys Pro Gly Gin 
290 295 300 

Arg Val Tyr Val Leu Gly Glu Gly Gly Gin Gin Leu Leu Pro Ala Ser 
305 310 315 320 

Val His Ser Val Ser Leu Arg Glu Glu Ala Ser Gly Ala Tyr Ala Pro 

325 330 335 

Leu Thr Ala Gin Gly Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys 
340 345 350 

Tyr Ala Val He Glu Glu His Ser Trp Ala His Trp Ala Phe Ala Pro 
355 360 365 

Phe Arg Leu Ala Gin Gly Leu Leu Ala Ala Leu Cys Pro Asp Gly Ala 
370 375 380 

lie Pro Thr Ala Ala Thr Thr Thr Thr Gly He His Trp Tyr Ser Arg 
385 390 395 400 

Leu Leu Tyr Arg He Gly Ser Trp Val Leu Asp Gly Asp Ala Leu His 

405 410 415 

Pro Leu Gly Met Val Ala Pro Ala Ser 
420 425 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION^ SEQ ID NO: 11: 
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Met Ala Leu Pro Ala 
1 5 



Ser Leu Leu Pro Leu Cys Cys Leu Ala Leu 

10 15 



Leu 



Ala Leu Ser Ala Gin Ser Cys Gly Pro Gly Arg Gly Pro Val Gly Arg 
20 . 25 30 

Arg Arg Tyr Val Arg Lys Gin Leu Val Pro Leu Leu Tyr Lys Gin Phe 
35 40 45 

Val Pro Ser Met Pro Glu Arg Thr Leu Gly Ala Ser Gly Pro Ala Glu 
50 55 60 

Gly Arg Val Thr Arg Gly Ser Glu Arg Phe Arg Asp Leu Val Pro Asn 
65 70 75 80 

Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Ser Gly Ala Asp 

85 90 95 

Arg Leu Met Thr Glu Arg Cys Lys Glu Arg Val Asn Ala Leu Ala lie 
100 105 110 

Ala Val Met Asn Met Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 
115 120 125 

Trp Asp Glu Asp Gly His His Ala Gin Asp Ser Leu His Tyr Glu Gly 
130 135 140 

Arg Ala Leu Asp lie Thr Thr Ser Asp Arg Asp Arg Asn Lys Tyr Gly 
145 150 155 160 

Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 

165 170 175 

Glu Ser Arg Asn His He His Val Ser Val Lys Ala Asp Asn Ser Leu 
180 185 190 

Ala Val Arg Ala Gly Gly Cys Phe Pro Gly A^n Ala Thr Val Arg Leu 
195 200 205 

Arg Ser Gly Glu Arg Lys Gly Leu Arg Glu Leu His Urg Gly Asp Trp 
210 215 220 

Val Leu Ala Ala Asp Ala Ala Gly Arg Val Val Pro Thr Pro Val Leu 
225 230 235 240 

Leu Phe Leu Asp Arg Asp Leu Gin Arg Arg Ala Ser Phe Val Ala Val 

245 250 255 

Glu Thr Glu Arg Pro Pro Arg Lys Leu Leu Leu Thr Pro Trp His Leu 
260 265 270 

Val Phe Ala Ala Arg Gly Pro Ala Pro Ala Pro Gly Asp Phe Ala Pro 
275 280 285 

Val Phe Ala Arg Arg Leu Arg Ala Gly Asp Ser Val Leu Ala Pro Gly 
290 295 300 

Gly Asp Ala Leu Gin Pro Ala Arg Val Ala Arg Val Ala Arg Glu Glu 



305 



310 



315 



320 



22 



wo 01/98344 PCT/USOl/19435 

Ala Val Gly Val Phe Ala Pro Leu Thr Ala His Gly Thr Leu Leu Val 

325 330 335 

Asn Asp Val Leu Ala Ser Cys Tyr Ala Val Leu Glu Ser His Gin Trp 
340 345 350 

* 

Ala His Arg Ala Phe Aia Pro Leu Arg Leu Leu His Ala Leu Gly Ala 
355 360 365 

Leu Leu Pro Gly Gly Ala Val Gin Pro Thr Gly Met His Trp Tyr Ser 
370 375 380 

T^g Leu Leu Tyr Arg Leu Ala Glu Glu Leu Met Gly 
385 390 395 



(2) INFORMATION FOR SEQ ID NO: 12; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 12: 

Met Ser Pro Ala Trp Leu Arg Pro Arg Leu Arg Phe Cys Leu Phe Leu 
15 10 15 

Leu Leu Leu Leu Leu Val Pro Ala Ala Arg Gly Cys Gly Pro Gly Arg 
20 25 30 

Val Val Gly Ser Arg Arg Arg Pro Pro Arg Lys Leu Val Pro Leu Ala 
35 40 45 

Tyr Lys Gin Phe Ser Pro Asn Val Pro Glu Lys Thr Leu Gly Ala Ser 
50 55 60 

Gly Arg Tyr Glu Gly Lys He Ala Arg Ser Ser Glu Arg Phe Lys Glu 
65 70 75 80 

Leu Thr Pro Asn Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn 

85 90 95 

Thr Gly Ala Asp Arg Leu Met Thr Gin Arg Cys Lys Asp Arg Leu Asn 
100 105 110 

« 

Ser Leu Ala He Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg 
115 120 125 

Val Thr Glu Gly Arg Asp Glu Asp Gly His His Ser Glu Glu Ser Leu 
130 135 140 

His Tyr Glu Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg 
145 150 155 160 

Asn Lys Tyr Gly Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp 

165 170 175 
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Trp Val Tyr Tyr Glu Ser Lys Ala His Val His Cys Ser Val I-ys Ser 
180 185 190 

Glu His Ser Ala Ala Ala Lys Thr Gly Gly Cys Phe Pro Ala Gly Ala 
195 200 205 

Gin Val Arg Leu Glu Asn Gly Glu Arg Val Ala Leu Ser Ala Val Lys 
210 215 220 

Pro Glv Asp Arg Val Leu Ala Met Gly Glu Asp Gly Thr Pro Thr Phe 
225 230 235 240 

Ser Asp Val Leu lie Phe Leu Asp Arg Glu Pro Asn Arg Leu Arg Ala 

245 • 250 255 

Phe Gin Val lie Glu Thr Gin Asp Pro Pro Arg Arg Leu Ala Leu Thr 
260 265 270 

Pro Ala His Leu Leu Phe He Ala Asp Asn His Thr Glu Pro Ala Ala 
275 280 • 285 

His Phe Arg Ala Thr Phe Ala Ser His Val Gin Pro Gly Gin Tyr Val 
290 295 300 

Leu Val Ser Gly Val Pro Gly Leu Gin Pro Ala Arg Val Ala Ala Val 
305 310 315 320 

Ser Thr His Val Ala Leu Gly Ser Tyr Ala Pro Leu Thr Arg His Gly 

325 330 335 

Thr Leu Val Val Glu Asp Val Val Ala Ser Cys Phe Ala Ala Val Ala 
340 345 350 

Asp His His Leu Ala Gin Leu Ala Phe Trp Pro Leu Arg Leu Phe Pro 
355 360 365 

Ser Leu Ala Trp Gly Ser Trp Thr Pro Ser Glu Gly Val His Ser Tyr 
370 375 380 

Pro Gin Met Leu Tyr Arg Leu Gly Arg Leu Leu Leu Glu Glu Ser Thr 
385 390 395 400 

Phe His Pro Leu Gly Met Ser Gly Ala Gly Ser 

405 410 



(2) INFORMATION FOR SEQ ID NO: 13: 

« 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
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Met Leu Leu Leu Leu Ala Arg Cys Phe Leu Val lie Leu Ala Ser Ser 
15 10 15 

Leu Leu Val Cys Pro Gly Leu Ala Cys Gly Pro Gly Arg Gly Phe Gly 
20 25 30 

Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe 
35 40 45 

lie Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu 
50 55 60 

Gly Lys He Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn 
65 70 75 80 

Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp 

85 90 95 

Arg Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala He 
100 105 110 

Ser Val Met Asn Gin Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 
115 120 125 

Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu Gly 
130 135 140 

Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg Ser Lys Tyr Gly 
145 150 155 160 

Met Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 

165 170 175 

Glu Ser Lys Ala His He His Cys Ser Val Lys Ala Glu Asn Ser Val 
180 185 190 

Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His Leu 
195 200 205 

Glu Gin Gly Gly Thr Lys Leu Val Lys Asp Leu Arg Pro Gly Asp Arg 
210 215 220 

Val Leu Ala Ala Asp Asp Gin Gly Arg Leu Leu Tyr Ser Asp Phe Leu 
225 230 235 240 

Thr Phe Leu Asp Arg Asp Glu Gly Ala Lys Lys Val Phe Tyr Val He 

245 250 255 

Glu Thr Leu Glu Pro T^g Glu Arg Leu Leu Leu Thr Ala Ala His Leu 
260 . 265 270 

Leu Phe Val Ala Pro His Asn Asp Ser Gly Pro Thr Pro Gly Pro Ser 
275 280 285 

Ala Leu Phe Ala Ser Arg Val Arg Pro Gly Gin Arg Val Tyr Val Val 
290 295 300 

Ala Glu Arg Gly Gly Asp Arg Arg Leu Leu Pro Ala Ala Val His Ser 
305 310 315 320 
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Val Thr Leu Arg Glu Glu Glu Ma Gly Ala Tyr Ala Pro Leu Thr Ala 

325 330 335 

His Gly Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys Tyr Ala Val 
340 345 350 

lie Glu Glu His Ser Trp Ala His Arg Ala Phe Ala Pro Phe Arg Leu 
355 360 365 

Ala His Ala Leu Leu Ala Ala Leu Ala Pro Ala Arg Thr Asp Gly Gly 
370 375 380 

Gly Gly Gly Ser lie Pro Ala Ala Gin Ser Ala Thr Glu Ala Arg Gly 
385 390 395 400 

Ala Glu Pro Thr Ala Gly He His Trp Tyr Ser Gin Leu Leu Tyr His 

405 410 415 

He Gly Thr Trp Leu Leu Asp Ser Glu Thr Met His Pro Leu Gly Met 
420 425 430 



'Ala Val Lys Ser Ser 
435 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Arg Leu Leu Thr Arg Val Leu Leu Val Ser Leu Leu Thr Leu Ser 
15 10 15 

Leu Val Val Ser Gly Leu Ala Cys Gly Pro Gly Arg Gly Tyr Gly Arg 
20 25 30 

Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe lie 
35 40 45 

Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu Gly 
50 55 60 

Lys He Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr 
65 70 75 80 

Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp Arg 

85 90 95 

Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ser Leu Ala He Ser 
100 105 110 

Val Met Asn His Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp 
115 120 125 
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Asp 



Glu 
130 



Asp 



Gly His His 



Phe Glu Glu Ser Leu His Tyr Glu 
135 140 



Gly Arg 



Ala Val Asp lie Thr Thr Ser Asp Arg Asp Lys Ser Lys Tyr Gly Thr 
145 15p 155 160 

Leu Ser Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu 

165 170 175 

Ser Lys Ala His He His Cys Ser Val Lys Ala Glu Asn Ser Val Ala 
180 185 190 

Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Leu Val Ser Leu Gin 
195 -200 205 

Asp Gly Gly Gin Lys Ala Val Lys Asp Leu Asn Pro Gly Asp Lys Val 
210 215 220 

Leu Ala Ala Asp Ser Ala Gly Asn Leu Val Phe Ser Asp Phe He Met 
225 230 235 240 

Phe Thr Asp Arg Asp Ser Thr Thr Arg Arg Val Phe Tyr Val He Glu 

245 250 255 

Thr Gin Glu Pro Val Glu Lys He Thr Leu Thr Ala Ala His Leu Leu 
260 265 270 

Phe Val Leu Asp Asn Ser Thr Glu Asp Leu His Thr Met Thr Ala Ala 
275 280 285 • 

Tyr Ala Ser Ser Val Arg Ala Gly Gin Lys Val Met Val Val Asp Asp 
290 295 300 

Ser Gly Gin Leu Lys Ser Val He Val Gin Arg He Tyr Thr Glu Glu 
305 310 315 320 

Gin Arg Gly Ser Phe Ala Pro Val Thr Ala His Gly Thr He Val Val 

325 330 335 

Asp Arg He Leu Ala Ser Cys Tyr Ala Val He Glu Asp Gin Gly Leu 
340 345 350 

Ala His Leu Ala Phe Ala Pro Ala Arg Leu Tyr Tyr Tyr Val Ser Ser 
355 360 365 

Phe Leu Ser Pro Lys Thr Pro Ala Val Gly Pro Met Arg Leu Tyr Asn 
370 375 380 

Arg Arg Gly Ser Thr Gly Thr Pro Gly Ser Cys His Gin Met Gly Thr 
385 390 395 400 

Trp Leu Leu Asp Ser Asn Met Leu His Pro Leu Gly Met Ser Val Asn 

405 410 415 

Ser Ser 



(2) INFORMATION FOR SEQ ID NO: 15: 
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(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 475 amino acids 

(B) TYPE: aiaino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TXPE: protein 



(xi) SEQUENCE DESCRIPTION: 

Met Leu Leu Leu 
1 

Leu Val Cys Ser 
20 

Arg Arg His Pro 
35 

Pro Asn Val Ala 
50 



SEQ ID NO: 15: 

Ser Leu 
15 

Gly Lys 
Phe lie 
Glu Gly 



Ala Arg Cys Leu Leu Leu Val Leu Val Ser 
5 10 

Gly Leu- Ala* Cys Gly Pro Gly Arg Gly Phe 

25 30 

Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin 
40 45 

Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr 
55 60 



Lys lie Ser Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr 
65 70 75 80 

Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp Arg 

85 90 95 

Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala lie Ser 
100 105 110 

Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp 
115 120 125 

Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu Gly Arg 
130 135 140 

Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg Ser Lys Tyr Gly Met 
145 150 155 160 

Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu 

165 170 175 

Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn Ser Val Ala 
180 185 190 

Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His Leu Glu 
195 200 205 

« 

Gin Gly Gly Thr Lys Leu Val Lys Asp Leu Ser Pro Gly Asp Arg Val 
210 215 220 

Leu Ala Ala Asp Asp Gin Gly Arg Leu Leu Tyr Ser Asp Phe Leu Thr 
225 230 235 240 

Phe Leu Asp Arg Asp Asp Gly Ala Lys Lys Val Phe Tyr Val He Glu 

245 250 255 

Thr Arg Glu Pro Arg Glu Arg Leu Leu Leu Thr Ala Ala His Leu Leu 
260 265 onn 



28 



wo 01/98344 



PCT/USOl/19435 



Phe Val Ala Pro His Asn Asp Ser Ala Thr Gly Glu Pro Glu Ala Ser 
275 280 285 

Ser Gly Ser Gly Pro Pro Ser Gly Gly Ala Leu Gly Pro Arg Ala Leu 
290 , 295 300 

Phe Ala Ser Arg Val Arg Pro Gly Gin Arg Val Tyr Val Val Ala Glu 
305 310 315 320 

Arg Asp Gly Asp Arg Arg Leu Leu Pro Ala Ala Val His Ser Val Thr 

325 330 335 

Leu Ser Glu Glu Ala Ala Gly Ala Tyr Ala Pro Leu Thr Ala Gin Gly 
340 • 345 350 

Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys Tyr Ala Val He Glu 
355 360 365 

Glu His Ser Trp Ala His Arg Ala Phe Ala Pro Phe Arg Leu Ala His 
370 375 380 

Ala Leu Leu Ala Ala Leu Ala Pro Ala Arg Thr Asp Arg Gly Gly Asp 
385 390 395 400 

Ser Gly Gly Gly Asp Arg Gly Gly Gly Gly Gly Arg Val Ala Leu Thr 

405 410 415 

JUa Pro Gly Ala Ala Asp Ala Pro Gly Ala Gly Ala Thr Ala Gly He 
420 425 430 

His Trp Tyr Ser Gin Leu Leu Tyr Gin He Gly Thr Trp Leu Leu Asp 
435 440 445 

Ser Glu Ala Leu His Pro Leu Gly Met Ala Val Lys Ser Ser Xaa Ser 
450 455 460 

Arg Gly Ala Gly Gly Gly Ala Arg Glu Gly Ala 
465 470 475 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Ser Pro Ala Arg Leu Arg Pro Arg Leu His Phe Cys Leu Val Leu 
15 10 15 

Leu Leu Leu Leu Val Val Pro Ala Ala Trp Gly Cys Gly Pro Gly Arg 
20 25 30 

Val Val Gly Ser Arg Arg Arg Pro Pro Arg Lys Leu Val Pro Leu Ala 
35 40 45 
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Tvr Lvs Gin Phe Ser Pro Asn Val Pro Glu Lys Thr Leu Gly Ala Ser 
50 55 60 

Glv Ara Tyr Glu Gly Lys He Ala Arg Ser Ser Glu Arg Phe Lys Glu 
65 70 75 80 

Leu Thr Pro Asn Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn 

85 90 95 

Thr Gly Ala Asp Arg Leu Met Thr Gin Arg Cys Lys Asp Arg Leu Asn 
100 105 110 

Ser Leu Ala He Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg 
115 120 125 

Val Thr Glu Gly Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu 
130 135 140 

His Tyr Glu Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg 
145 150 155 160 

' Asn Lys Tyr Gly Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp 

165 170 175 

Trp Val Tyr Tyr Glu Ser Lys Ala His Val His Cys Ser Val Lys Ser 
180 185 190 

Glu His Ser Ala Ala Ala Lys Thr Gly Gly Cys Phe Pro Ala Gly Ala 
195 200 205 

Gin Val Arg Leu Glu Ser Gly Ala Arg Val Ala Leu Ser Ala Val TVrg 
210 215 220 

Pro Gly Asp Arg Val Leu Ala Met Gly Glu Asp Gly Ser Pro Thr Phe 
225 230 235 240 

Ser Asp Val Leu He Phe Leu Asp Arg Glu Pro His Arg Leu Arg Ala 

245 250 255 

Phe Gin Val He Glu Thr Gin Asp Pro Pro Arg Arg Leu Ala Leu Thr 
260 265 270 

Pro Ala His Leu Leu Phe Thr TVla Asp Asn His Thr Glu Pro Ala Ala 
275 280 285 

Arg Phe Arg Ala Thr Phe Ala Ser His Val Gin Pro Gly Gin Tyr Val 
290 295 300 

Leu Val Ala Gly Val Pro Gly Leu Gin Pro Ala Arg Val Ala Ala Val 
305 310 315 320 

Ser Thr His Val Ala Leu Gly Ala Tyr Ala Pro Leu Thr Lys His Gly 

325 330 335 

Thr Leu Val Val Glu Asp Val Val Ala Ser Cys Phe Ala Ala Val Ala 
340 345 350 

Asp His His Leu Ala Gin Leu Ala Phe Trp Pro Leu Arg Leu Phe His 
355 360 365 
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Ser Leu Ala Trp Gly Ser Trp Thr Pro Gly Glu Gly Val His Trp Tyr 
370 375 380 

Pro Gin Leu Leu Tyr Arg Leu Gly Arg Leu Leu Leu Glu Glu Gly Ser 
385 390 395 400 

* 

Phe His Pro Leu Gly Met Ser Gly Ala Gly Ser 

405 410 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 aioino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Ala Leu Leu Thr Asn Leu Leu Pro Leu Cys Cys Leu Ala Leu Leu 
15 10 15 

Ala Leu Pro Ala Gin Ser Cys Gly Pro Gly Arg Gly Pro Val Gly Arg 
20 25 30 

Arg Arg Tyr Ala Arg Lys Gin Leu Val Pro Leu Leu Tyr Lys Gin Phe 
35 40 45 

Val Pro Gly Val Pro Glu Arg Thr Leu Gly Ala Ser Gly Pro Ala Glu 
50 55 60 

Gly Arg Val Ala Arg Gly Ser Glu Arg Phe Arg Asp Leu Val Pro Asn 
65 70 75 80 

Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Ser Gly Ala Asp 

85 90 95 

Arg Leu Met Thr Glu Arg Cys Lys Glu Arg Val Asn Ala Leu Ala lie 
100 105 110 

Ala Val Met Asn Met Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 
115 120 125 

Trp Asp Glu Asp Gly His His Ala Gin Asp Ser Leu His Tyr Glu Gly 
130 135 140 

Arg Ala Leu Asp lie Thr Thr Ser Asp Arg Asp Arg Asn Lys Tyr Gly 
145 150 155 160 

4 

Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 

165 170 175 

Glu Ser Arg Asn His Val His Val Ser Val Lys Ala Asp Asn Ser Leu 
180 185 190 

Ala Val Arg Ala Gly Gly Cys Phe Pro Gly Asn Ala Thr Val Arg Leu 
195 200 205 

TiTp Ser Gly Glu Arg Lys Gly Leu Arg Glu Leu His Arg Gly Asp Trp 
210 215 220 
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Val Leu Ala Ala Asp TVla Ser Gly Arg Val Val Pro Thr Pro Val Leu 
225 230 235 240 

Leu Phe Leu Asp Arg Asp Leu Gin Arg Arg Ala Ser Phe Val Ala Val 

245 , 250 255 

Glu Thr Glu Trp Pro Pro Arg Lys Leu Leu Leu Thr Pro Trp His Leu 
260 265 270 

Val Phe Ala Ala Arg Gly Pro Ala Pro Ala Pro Gly Asp Phe Ala Pro 
275 280 285 

Val Phe Ala Arg Arg Leu Arg Ala Gly Asp Ser Val Leu Ala Pro Gly 
290 295 ■ 300 

Gly Asp Ala Leu Arg Pro Ala Arg Val Ala Arg Val Ala Arg Glu Glu 
305 310 315 320 

Ala Val Gly Val Phe Ala Pro Leu Thr Ala H4s Gly Thr Leu Leu Val 

325 330 335 

Asn Asp Val Leu Ala Ser Cys Tyr Ala Val Leu Glu Ser His Gin Trp 
340 345 350 

Ala His Arg Ala Phe Ala Pro Leu Arg Leu Leu His Ala Leu Gly Ala 
355 360 365 

Leu Leu Pro Gly Gly Ala Val Gin Pro Thr Gly Met His Trp Tyr Ser 
370 375 380 

Arg Leu Leu Tyr Arg Leu Ala Glu Glu Leu Leu Gly 
385 390 395 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Asp Val Arg Leu His Leu Lys Gin Phe Ala Leu Leu Cys Phe lie 
15 10 15 

Ser Leu Leu Leu Thr Pro Cys Gly Leu Ala Cys Gly Pro Gly Arg Gly 
20 25 30 

Tyr Gly Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys 
35 40 45 

Gin Phe He Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Lys 
50 55 60 

Tyr Glu Gly Lys He Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu He 
65 70 75 80 



32 



wo 01/98344 



PCT/USOl/19435 



Pro 



Asn 



Tyr 



Asn 



Pro Asp He He 
85 



Phe Lys Asp Glu Glu Asn 
90 



Thr Asn 
95 



Ala Asp Arg Leu Met Thr Lys Arg Cys Lys Asp Lys Leu Asn Ser Leu 
100 . 105 110 

Ala He Ser Val Met Asn His Trp Pro Gly Val Lys Leu Arg Val Thr 
115 120 125 

Glu Gly Trp Asp Glu- Asp Gly His His Leu Glu Glu Ser Leu His Tyr 
130 135 140 

Glu Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Lys Ser Lys 
145 150 155 160 

Tyr Gly Met Leu Ser Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 

165 170 175 

Tyr Tyr Glu Ser Lys Ala His He His Cys Ser Val Lys Ala Glu Asn 
180 185 190 

Ser Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Gly Thr Val 
195 200 205 

Thr Leu Gly Asp Gly Thr Arg Lys Pro He Lys Asp Leu Lys Val Gly 
210. 215 220 

Asp Arg Val Leu Ala Ala Asp Glu Lys Gly Asn Val Leu He Ser Asp 
225 230 235 240 

Phe He Met Phe He Asp His Asp Pro Thr Thr Arg Arg Gin Phe He 

245 250 255 

Val He Glu Thr Ser Glu Pro Phe Thr Lys Leu Thr Leu Thr Ala Ala 
260 265 270 

His Leu Val Phe Val Gly Asn Ser Ser Ala Ala Ser Gly He Thr Ala 
275 280 285 

Thr Phe Ala Ser Asn Val Lys Pro Gly Asp Thr Val Leu Val Trp Glu 
290 295 300 

Asp Thr Cys Glu Ser Leu Lys Ser Val Thr Val Lys Arg He Tyr Thr 
305 310 315 320 

Glu Glu His Glu Gly Ser Phe Ala Pro Val Thr Ala His Gly Thr He 

325 330 ■ 335 

He Val Asp Gin Val Leu Ala Ser Cys Tyr Ala Val He Glu Asn His 
340 345 350 

Lys Trp Ala His Trp Ala Phe Ala Pro Val Arg Leu- Cys His Lys Leu 
355 360 365 

Met Thr Trp Leu Phe Pro Ala Arg Glu Ser Asn Val Asn Phe Gin Glu 
370 375 380 

Asp Gly He His Trp Tyr Ser Asn Met Leu Phe His He Gly Ser Trp 
385 390 395 400 
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Leu Leu Asp Arg Asp Ser Phe His Pro Leu Gly lie Leu His Leu Ser 

405 410 415 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1413 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATG GAT AAC CAC AGC TCA GTG COT TGG GCC AGT GCC GCC AGT GTC ACC 48 
Met Asp Asn His Ser Ser Val Pro Trp Ala Ser Ala Ala Ser Val Thr 
15 10 15 

TGT CTC TCC CTG GGA TGC CAA ATG CCA CA6 TTC CAG TTC CAG TTC CAG 96 
Cys Leu Ser Leu Gly Cys Gin Met Pro Gin Phe Gin Phe Gin Phe Gin 
20 25 30 

CTC CAA ATC CGC AGC GAG CTC CAT CTC CGC AAG CCC GCA AGA AGA ACG 144 
Leu Gin lie Arg Ser Glu Leu His Leu Arg Lys Pro Ala Arg Arg Thr 
35 40 45 

CAA ACG ATG CGC CAC ATT GCG CAT ACG CAG CGT TGC CTC AGC AGG CTG 192 
Gin Thr Met Arg His lie Ala His Thr Gin Arg Cys Leu Ser Arg Leu 
50 .55 60 * 

ACC TCT CTG GTG GCC CTG CTG CTG ATC GTC TTG CCG ATG GTC TTT AGC 240 
Thr Ser Leu Val Ala Leu Leu Leu lie Val Leu Pro Met Val Phe Ser 
65 70 75 80 

CCG GCT CAC AGC TGC GGT CCT GGC CGA GGA TTG GGT CGT CAT AGG GCG - 288 
Pro Ala His Ser Cys Gly Pro Gly Arg Gly Leu Gly Arg His Arg Ala 

85 90 95 

CGC AAC CTG TAT CCG CTG GTC CTC AAG CAG ACA ATT CCC AAT CTA TCC 336 
Arg Asn Leu Tyr Pro Leu Val Leu Lys Gin Thr lie Pro Asn Leu Ser 
100 105 110 

4 

GAG TAC ACG AAC AGC GCC TCC GGA CCT CTG GAG GGT GTG ATC CGT CG6 384 
Glu Tyr Thr Asn Ser Ala Ser Gly Pro Leu Glu Gly Val lie Arg Arg 
115 120 125 

GAT TCG CCC AAA TTC AAG GAC CTC GTG CCC AAC TAC AAC AGG GAC ATC 432 
Asp Ser Pro Lys Phe Lys Asp Leu Val Pro Asn Tyr Asn Arg Asp lie 
130 135 140 

CTT TTC CGT GAC GAG GAA GGC ACC GGA GCG GAT GGC TTG ATG AGC AAG 480 
Leu Phe Arg Asp Glu Glu Gly Thr Gly Ala Asp Gly Leu Met Ser Lys 
145 150 . ■ 155 160 
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CGC TGC AAG GAG AAG CTA AAC GTG CTG GCC TAC TCG GTG ATG AAC GAA 528 
Arg Cys Lys Glu Lys Leu Asn Val Leu Ala Tyr Ser Val Met Asn Glu 

165 170 175 

TGG CCC GGC ATC CGG CTG CTG GTC ACC GAG AGC TGG GAC GAG GAC TAC 576 
Trp Pro Gly lie Arg Leu Leu Val Thr Glu Ser Trp Asp Glu Asp Tyr 
180 185 190 

CAT CAC GGC CAG GAG TCG CTC CAC TAC GAG GGC CGA GCG GTG ACC ATT 624 
His His Gly Gin Glu Ser Leu His Tyr Glu Gly Arg Ala Val Thr lie 
195 200 205 

GCC ACC TCC GAT CGC GAC CAG TCC AAA TAC GGC ATG CTC GCT CGC CTG 672 
Ala Thr Ser Asp Arg Asp Gin Ser Lys Tyr Gly Met Leu Ala Arg Leu 
210 215 220 

GCC GTC GAG GCT GGA TTC GAT TGG GTC TCC TAC GTC AGC AGG CGC CAC 720 
Ala Val Glu Ala Gly Phe Asp Trp Val Ser Tyr Val Ser Arg Arg His 
225 230 235 240 

*ATC TAC TGC TCC GTC AAG TCA GAT TCG TCG ATC AGT TCC CAC GTG CAC 768 
lie Tyr Cys Ser Val Lys Ser Asp Ser Ser lie Ser Ser His Val His 

245 250 255 

GGC TGC TTC ACG CCG GAG AGC ACA GCG CTG CTG GAG AGT GGA GTC CGG 816 
Gly Cys Phe Thr Pro Glu Ser Thr Ala Leu Leu Glu Ser Gly Val Arg 
260 265 270 

AAG CCG CTC GGC GAG CTC TCT ATC GGA GAT CGT GTT TTG AGC ATG ACC 864 
Lys Pro Leu Gly Glu Leu Ser He Gly Asp Arg Val Leu Ser Met Thr 
275 280 285 

GCC AAC GGA. CAG GCC GTC TAC AGC GAA GTG ATC CTC TTC ATG GAC CGC 912 
Ala Asn Gly Gin Ala Val Tyr Ser Glu Val He Leu Phe Met Asp Arg 
290 295 300 

AAC CTC GAG CAG ATG CAA AAC TTT GTG CAG CTG CAC ACG GAC GGT GGA 960' 
Asn Leu Glu Gin Met Gin Asn Phe Val Gin Leu His Thr Asp Gly Gly 
305 310 315 320 

GCA GTG CTC ACG GTG ACG CCG GCT CAC CTG GTT AGC GTT TGG CAG C'CG 1008 
Ala Val Leu Thr Val Thr Pro Ala His Leu Val Ser Val Trp Gin Pro 

325 330 335 

GAG AGC CAG AAG CTC ACG TTT GTG TTT GCG CAT CGC ATC GAG GAG AAG 1056 
Glu Ser Gin Lys Leu Thr Phe Val Phe Ala His Arg He Glu Glu Lys 
340 345 350 

AAC CAG GTG CTC GTA CGG GAT GTG GAG ACG GGC GAG CTG AGG CCC CAG 1104 
Asn Gin Val Leu Val Arg Asp Val Glu Thr Gly Glu Leu Arg Pro Gin 
355 360 365 

CGA GTG GTC AAG TTG GGC AGT GTG CGC AGT AAG GGC GTG GTC GCG CCG 1152 
Arg Val Val Lys Leu Gly Ser Val Arg Ser Lys Gly Val Val Ala Pro 
370 375 380 

CTG ACC CGC GAG GGC ACC ATT GTG GTC AAC TCG GTG GCC GCC AGT TGC 1200 
Leu Thr Arg Glu Gly Thr He Val Val Asn Ser Val Ala Ala Ser Cys 
385 390 395 400 
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TAT GCG GTG ATC AAC AGT 
Tyr Ala Val lie Asn Ser 

405 

ATG CGC CTG CTG TCC ACG 
Met Arg Leu Leu Ser Thr 
420 

TTG CAC AGT TCG CCG AAG 
Leu His Ser Ser Pro Lys 
435 

ATC CAT TGG TAT GCG AAT 
He His Trp Tyr Ala Asn 
450 

CCG CAG AGC TGG CGC CAC 
Pro Gin Ser Tirp Arg His 
465 470 



CAG TCG CTG GCC CAC TGG 
Gin Ser Leu Ala His Trp 
410 

CTG GAG GCG TGG CTG CCC 
Leu Glu 7U.a Trp Leu Pro 
425 

GTG GTG AGC TCG GCG CAG 
Val Val Ser Ser Ala Gin 
440 

GCG CTC TAC AAG GTC AAG 
Ala Leu Tyr Lys Val Lys 
455 * 460 

GAT TGA 
Asp 



GGA CTG GCT CCC 1248 
Gly Leu Ala Pro 
415 

GCC AAG GAG CAG 1296 
Ala Lys Glu Gin 
430 

CAG CAG AAT GGC 1344 

Gin Gin Asn Gly 

445 

GAC TAC GTG CTG 1392 
Asp Tyr Val Leu 



1416 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Asp Asn His Ser Ser Val Pro Trp Ala Ser Ala Ala Ser Val Thr 
15 10 15 

Cys Leu Ser Leu Gly Cys Gin Met Pro Gin Phe Gin Phe Gin Phe Gin 
20 25 30 

Leu Gin He Arg Ser Glu Leu His Leu Arg Lys Pro Ala Arg Arg Thr 
35 40 45 

Gin Thr Met Arg His He Ala His Thr Gin Arg Cys Leu Ser Arg Leu 
50 55 60 

Thr Ser Leu Val Ala Leu Leu Leu He Val Leu Pro Met Val Phe Ser 
65 70 75 80 

Pro Ala His Ser Cys Gly Pro Gly Arg Gly Leu Gly Arg His Arg Ala 

85 90 95 

Arg Asn Leu Tyr Pro Leu Val Leu Lys Gin Thr He Pro Asn Leu Ser 
100 105 110 

Glu Tyr Thr Asn Ser Ala Ser Gly Pro Leu Glu Gly Val He Arg Arg 
115 120 125 

Asp Ser Pro Lys Phe Lys Asp Leu Val Pro Asn Tyr Asn Arg Asp He 
130 135 140 

Leu Phe Arg Asp Glu Glu Gly Thr Gly Ala Asp Gly Leu Met Ser Lys 
145 150 155 160 
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Arg Cys Lys Glu Lys Leu Asn Val Leu Ala Tyr Ser Val Met Asn Glu 

165 170 175 

Trp Pro Gly lie Arg Leu Leu Val Thr Glu Ser Trp Asp Glu Asp Tyr 
180 . 185 190 

His His Gly Gin Glu Ser Leu His Tyr Glu Gly Arg Ala Val Thr He 
195 200 205 

Ala Thr Ser Asp Arg Asp Gin Ser Lys Tyr Gly Met Leu Ala Arg Leu 
210 215 220 

Ala Val Glu Ala Gly Phe Asp Trp Val Ser Tyr Val Ser Arg Arg His 
225 230 ' 235 240 

He Tyr Cys Ser Val Lys Ser Asp Ser Ser He Ser Ser His Val His 

245 250 255 

Gly Cys Phe Thr Pro Glu Ser Thr Ala Leu Leu Glu Ser Gly Val Arg 
260 265 270 

Lys Pro Leu Gly Glu Leu Ser He Gly Asp Arg Val Leu Ser Met Thr 
275 280 285 

Ala Asn Gly Gin Ala Val Tyr Ser Glu Val He Leu Phe Met Asp Arg 
290 295 300 

Asn Leu Glu Gin Met Gin Asn Phe Val Gin Leu His Thr Asp Gly Gly 
305 310 315 320 

Ala Val Leu Thr Val Thr Pro Ala His Leu Val Ser Val Trp Gin Pro 

325 330 335 

Glu Ser Gin Lys Leu Thr Phe Val Phe Ala His Arg He Glu Glu Lys 
340 345 350 

Asn Gin Val Leu Val Arg Asp Val Glu Thr Gly Glu Leu Arg Pro Gin 
355 360 365 

Arg Val Val Lys Leu Gly Ser Val Arg Ser Lys Gly Val Val Ala Pro 
370 375 380 



Leu Thr Arg Glu Gly 
385 

Tyr Ala Val He Asn 

405 

Met Arg Leu Leu Ser 
420 

Leu His Ser Ser Pro 
435 

He His Trp Tyr Ala 
450 

Pro Gin Ser Trp Arg 
4 65 



Thr He Val Val Asn Ser 
390 395 

Ser Gin Ser Leu Ala His 

410 

Thr Leu Glu Ala Trp Leu 
425 

Lys Val Val Ser Ser Ala 
440 

Asn Ala Leu Tyr Lys Val 
455 

His Asp 
470 



Val Ala Ala Ser Cys 

400 

Trp Gly Leu Ala Pro 
415 

Pro 7U.a Lys Glu Gin 
.430 

Gin Gin Gin Asn Gly 
445 

Lys Asp Tyr Val Leu 
460 
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(2) INFORMATION FOR SEQ ID N0j.21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 221 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Cys Gly Pro Gly Arg Gly Xaa Gly Xaa Arg Arg His Pro Lys Lys Leu 
15 10 15 

Thr Pro Leu Ala Tyr Lys Gin Phe He Pro Asn Val Ala Glu Lys Thr 
20 25 30 

Leu Gly Ala Ser Gly Arg Tyr Glu Gly Lys He Xaa Arg Asn Ser Glu 
35 40 45 

Arg Phe Lys Glu Leu Thr Pro Asn Tyr Asn Pro Asp He He Phe Lys 
50 55 60 

Asp Glu Glu Asn Thr Gly Ala Asp Arg Leu Met Thr Gin Arg Cys Lys 
65 70 75 80 

Asp Lys Leu Asn Xaa Leu Ala He Ser Val Met Asn Xaa Trp Pro Gly 

85 90 95 

Val Xaa Leu Arg Val Thr Glu Gly Trp Asp Glu Asp Gly His His Xaa 
100 105 110 

Glu Glu Ser Leu His Tyr Glu Gly Arg Ala Val Asp He Thr Thr Ser 
115 120 125 

Asp Arg Asp Xaa Ser Lys Tyr Gly Xaa Leu Xaa Arg Leu Ala Val Glu 
130 135 140 

Ala Gly Phe Asp Trp Val Tyr Tyr Glu Ser Lys Ala His He His Cys 
145 150 155 160 

Ser Val Lys Ala Glu Asn Ser Val Ala Ala Lys Ser Gly Gly Cys Phe 

165 170 175 

Pro Gly Ser Ala Xaa Val Xaa Leu Xaa Xaa Gly Gly Xaa Lys Xaa Val 
180 . 185 190 

Lys Asp Leu Xaa Pro Gly Asp Xaa Val Leu Ala Ala Asp Xaa Xaa Gly 
195 200 205 

Xaa Leu Xaa Xaa Ser Asp Phe Xaa Xaa Phe Xaa Asp Arg 
210 215 220 



{2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: peptide 



(V) 



FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Cys Gly Pro Gly Arg Gly Xaa Xaa Xaa Arg Arg Xaa Xaa Xaa Pro Lys 
15 10 15 

Xaa Leu Xaa Pro Leu Xaa Tyr Lys Gin Phe Xaa Pro Xaa Xaa Xaa Glu 
20 25 30 

Xaa Thr Leu Gly Ala Ser Gly Xaa Xaa Glu Gly Xaa Xaa Xaa Arg Xaa 
35 40 45 

Ser Glu Arg Phe Xaa Xaa Leu Thr Pro Asn Tyr Asn Pro Asp lie lie 
50 55 60 

Phe Lys Asp Glu Glu Asn Xaa Gly Ala Asp Arg Leu Met Thr Xaa Arg 
65 70 75 80 

Cys Lys Xaa Xaa Xaa Asn Xaa Leu Ala lie Ser Val Met Asn Xaa Trp 

85 90 95 

Pro Gly Val Xaa Leu Arg Val Thr Glu Gly Xaa Asp Glu Asp Gly His 
100 105 110 

His Xaa Xaa Xaa Ser Leu His Tyr Glu Gly Arg TVla Xaa Asp lie Thr 
115 120 125 

Thr Ser Asp Arg Asp Xaa Xaa Lys Tyr Gly Xaa Leu Xaa Arg Leu Ala 
130 135 140 

Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu Ser Xaa Xaa His Xaa 
145 150 155 160 

His Xaa Ser Val Lys Xaa Xaa 



165 



39 



